Odorant receptors and uses thereof

ABSTRACT

The invention provides an isolated nucleic acid molecule encoding an odorant receptor. The invention also provides expression vectors containing such nucleic acid, purified odorant receptor proteins encoded by the nucleic acid, and transfected cells expressing the receptor proteins. The invention further provides methods of identifying odorant ligands and odorant receptors, of developing fragrances, of identifying appetite suppressant compounds, of controlling appetite, of controlling pest populations, of promoting and inhibiting fertility, and of detecting odors.

[0001] This application is a continuation-in-part of U.S. Ser. No. 08/129,079, filed Oct. 5, 1993, a national stage filing of PCT International Application No. PCT/US92/02741, filed Apr. 6, 1992, which claims priority of and is a continuation-in-part of U.S. Ser. No. 07/681,880, filed Apr. 5, 1991, now abandoned, the contents of all of which are hereby incorporated by reference.

[0002] The invention disclosed herein was made with Government support under grant number RO1-CA 23767 from the National Institutes of Health, U.S. Department of Health and Human Services. Accordingly, the U.S. Government has certain rights in this invention.

BACKGROUND OF THE INVENTION

[0003] Throughout this application, various publications are referenced by Arabic numerals within parentheses. Full citations for these publications may be found at the end of the specification immediately preceding the claims. The disclosures of these publications in their entireties are hereby incorporated by reference into this application in order to more fully describe the state of the art as known to those skilled therein as of the date of the invention described and claimed herein.

[0004] In vertebrate sensory systems, peripheral neurons respond to environmental stimuli and transmit these signals to higher sensory centers in the brain where they are processed to allow the discrimination of complex sensory information. The delineation of the peripheral mechanisms by which environmental stimuli are transduced into neural information can provide insight into the logic underlying sensory processing. Our understanding of color vision, for example, emerged only after the observation that the discrimination of hue results from the blending of information from only three classes of photoreceptors (1, 2, 3, 4). The basic logic underlying olfactory sensory perception, however, has remained elusive. Mammals possess an olfactory system of enormous discriminatory power (5, 6). Humans, for example, are thought to be capable of distinguishing among thousands of distinct odors. The specificity of odor recognition is emphasized by the observation that subtle alterations in the molecular structure of an odorant can lead to profound changes in perceived odor.

[0005] The detection of chemically distinct odorant presumably results from the association of odorous ligands with specific receptors on olfactory neurons which reside in a specialized epithelium in the nose. Since these receptors have not been identified, it has been difficult to determine how odor discrimination might be achieved. It is possible that olfaction, by analogy with color vision, involves only a few odor receptors, each capable of interaction with multiple odorant molecules. Alternatively, the sense of smell may involve a large number of distinct receptors each capable of associating with one or a small number of odorant. In either case, the brain must distinguish which receptors or which neurons have been activated to allow the discrimination between different odorant stimuli. Insight into the mechanisms underlying olfactory perception is likely to depend upon the isolation of the odorant receptors, and the characterization of their diversity, specificity, and patterns of expression.

[0006] The primary events in odor detection occur in a specialized olfactory neuroepithelium located in the posterior recesses of the nasal cavity. Three cell types dominate this epithelium (FIG. 1A): the olfactory sensory neuron, the sustentacular or supporting cell, and the basal cell which is a stem cell that generates olfactory neurons throughout life (7, 8). The olfactory sensory neuron is bipolar: a dendritic process extends to the mucosal surface where it gives rise to a number of specialized cilia which provide an extensive, receptive surface for the interaction of odors with olfactory sensory neurons. The olfactory neuron also gives rise to an axon which projects to the olfactory bulb of the brain, the first relay in the olfactory system. The axons of the olfactory bulb neurons, in turn, project to subcortical and cortical regions where higher level processing of olfactory information allows the discrimination of odors by the brain.

[0007] The initial events in odor discrimination are thought to involve the association of odors with specific receptors on the cilia of olfactory neurons. Selective removal of the cilia results in the loss of olfactory response (9). Moreover, in fish, whose olfactory system senses amino acids as odors, the specific binding of amino acids to isolated cilia has been demonstrated (10, 11). The cilia are also the site of olfactory signal transduction. Exposure of isolated cilia from rat olfactory epithelium to numerous odorant leads to the rapid stimulation of adenylyl cyclase and elevations in cyclic AMP (an elevation in IP3 in response to one odorant has also been observed) (12, 13, 14, 15). The activation of adenylyl cyclase is dependent on the presence of GTP and is therefore likely to be mediated by receptor-coupled GTP binding proteins (G-proteins) (16). Elevations in cyclic AMP, in turn, are thought to elicit depolarization of olfactory neurons by direct activation of a cyclic nucleotide-gated, cation permeable channel (17, 18). This channel is opened upon binding of cyclic nucleotides to its cytoplasmic domain, and can therefore transduce changes in intracellular levels of cyclic AMP into alterations in the membrane potential.

[0008] These observations suggest a pathway for olfactory signal transduction (FIG. 1B) in which the binding of odors to specific surface receptors activates specific G-proteins. The G-proteins then initiate a cascade of intracellular signaling events leading to the generation of an action potential which is propagated along the olfactory sensory axon to the brain. A number of neurotransmitter and hormone receptors which transduce intracellular signals by activation of specific G-proteins have been identified. Gene cloning has demonstrated that each of these receptors is a member of a large superfamily of surface receptors which traverse the membrane seven times (19, 20). The pathway of olfactory signal transduction (FIG. 1B) predicts that the odorant receptors might also be members of this superfamily of receptor proteins. The detection of odors in the periphery is therefore likely to involve signaling mechanisms shared by other hormone or neurotransmitter systems, but the vast discriminatory power of the olfactory system will require higher order neural processing to permit the perception of individual odors. This invention address the problem of olfactory perception at a molecular level. Eighteen different members of an extremely large multigene family have been cloned and characterized which encode seven transmembrane domain proteins whose expression is restricted to the olfactory epithelium. The members of this novel gene family encode the individual odorant receptors.

SUMMARY OF THE INVENTION

[0009] The invention provides an isolated nucleic acid, e.g. a DNA and cDNA molecule, encoding an odorant receptor. The invention further provides expression vectors containing such nucleic acid. Also provided by the invention is a purified odorant receptor protein encoded by the isolated nucleic acid. The invention further provides a method of transforming cells which comprises transfecting a suitable host cell with a suitable expression vector containing the nucleic acid encoding the odorant receptor.

[0010] The invention also provides methods of identifying odorant ligands and of identifying odorant receptors. The invention further provides methods of developing fragrances, of identifying appetite suppressant compounds, and of controlling appetite. The invention also provides methods of controlling animal populations. The invention additionally provides a method of detecting odors such as the vapors emanating from cocaine, marijuana, heroin, hashish, angel dust, gasoline, decayed human flesh, alcohol, gun powder explosives, plastic explosives, firearms, poisonous or harmful smoke, or natural gas.

DESCRIPTION OF THE FIGURES

[0011] FIGS. 1A-B. The Olfactory Neuroepithelium and a Pathway for Olfactory Signal Transduction.

[0012] (A). The Olfactory Neuroepithelium. The initial event in odor perception occurs in the nasal cavity in a specialized neuroepithelium which is diagramed here. Odors are believed to interact with specific receptors on the cilia of olfactory sensory neurons. The signal generated by these initial binding events are propagated by olfactory neuron axons to the olfactory bulb.

[0013] (B). A Pathway of Olfactory Signal Transduction. In this scheme, the binding of an odorant molecule to an odor-specific transmembrane receptor leads to the interaction of the receptor with a GTP-binding protein (G_(S[olf])) This interaction, in turn, leads to the release of the GTP-coupled a-subunit of the G-protein, which then stimulates adenylyl cyclase to produce elevated levels of cAMP. The increase in cAMP opens nucleotide-gated cation channels, thus causing an alteration in membrane potential.

[0014] FIGS. 2A-B. A PCR Amplification Product Containing Multiple Species of DNA. cDNA prepared from olfactory epithelium RNA was subjected to PCR amplification with a series of different primer oligonucleotides and the DNA products of appropriate size were isolated, further amplified by PCR, and size fractionated on agarose gels (A) (For details, see text). Each of these semipurified PCR products was digested with the restriction enzyme, Hinf I, and analyzed by agarose gel electrophoresis. Lanes marked “M” contain size markers of 23.1, 9.4, 5.6, 4.4, 2.3, 2.0, 1.35, 1.08, 0.87, 0.60, 0.31, 0.28, 0.23, 0.19, 0.12 and 0.07 kb. (B). Twenty-two of the 64 PCR products that were isolated and digested with Hinf I are shown here. Digestion of one of these, PCR 13, yielded a large number of fragments whose sizes summed to a value much greater than that of the undigested PCR 13 DNA, indicating that PCR 13 might contain multiple species of DNA which are representatives of a multigene family.

[0015]FIG. 3. Northern Blot Analysis with a Mixture of Twenty Probes. One μg of polyA+ RNA isolated from rat olfactory epithelium, brain, or spleen was size-fractionated in formaldehyde agarose, blotted onto a nylon membrane, and hybridized with a ³²P-labeled mixture of segments of 20 cDNA clones. The DNA segments were obtained by PCR using primers homologous to transmembrane domains 2 and 7.

[0016] FIGS. 4A-M. The Protein Sequences Encoded by Ten Divergent cDNA Clones. Ten divergent cDNA clones were subjected to DNA sequence analyses and the protein sequence encoded by each was determined (SEQ ID Nos: 71-80). Amino acid residues which are conserved in 60% or more of the proteins are shaded. The presence of seven hydrophobic domains (I-VII), as well as short conserved motifs shared with other members of the superfamily, demonstrate that these proteins belong to the seven transmembrane (TM) domain protein superfamily. The transmembrane regions are indicated by labeled lines (I-VII) above the sequences. Motifs conserved among members of the family of olfactory proteins include those indicated by underlining below the sequences. In addition, the DRY motif C-terminal to TM3 is common to many members of the G-protein-coupled superfamily. However, all of the proteins shown here share sequence motifs not found in other members of this superfamily and are clearly members of a novel family of proteins.

[0017]FIG. 5. Positions of Greatest Variability in the Olfactory Protein Family. In this diagram, the protein encoded by cDNA clone I15 is shown traversing the plasma membrane seven times with its N-terminus located extracellularly, and its C-terminus intracellularly. The vertical cylinders delineate the seven putative a-helices spanning the membrane. Positions at which 60% or more of the 10 clones shown in FIG. 4 share the same residue as I15 are shown as white balls. More variable residues are shown as black balls. The high degree of variability encountered in transmembrane domains III, IV, and V is evident in this schematic.

[0018] FIGS. 6A-D. The Presence of Subfamilies in a Divergent Multigene Family. Partial nucleotide sequences and deduced protein sequences were obtained for 18 different cDNA clones. Transmembrane domain V along with the flanking loop sequences, including the entire cytoplasmic loop between transmembrane domains V and VI, are shown here for each protein. Amino acid residues found in 60% or more of the clones in a given position are shaded (A). This region of the olfactory proteins (particularly transmembrane domain V) appears to be highly variable (see FIG. 4). These proteins, however, can be grouped into subfamilies (B,C,D) in which the individual subfamily members share considerable homology in this divergent region of the protein.

[0019]FIG. 7. Southern Blot Analyses with Non-crosshybridizing Fragments of Divergent cDNAs. Five μg of rat liver DNA was digested with Eco RI (A) or Hind III (B), electrophoresed in 0.75% agarose, blotted onto a nylon membrane, and hybridized to the ³²P-labeled probes indicated. The probes used were PCR-generated fragments of: 1, clone F9 (identical to F12 in FIG. 4); 2, F5; 3, F6; 4, I3; 5, I7; 6, I14; or 7, I15. The lane labeled “1-7” was hybridized to a mixture of the seven probes. The probes used showed either no crosshybridization or only trace crosshybridization with one another. The size markers on the left correspond to the four blots on the left (1-4) whereas the marker positions noted on the right correspond to the four blots on the right (5-7, “1-7”)

[0020]FIG. 8. Northern Blot Analysis with a Mix of Seven Divergent Clones. One μg of polyA+ RNA from each of the tissues shown was size-fractionated, blotted onto a nylon membrane, and hybridized with a ³²P-labeled mixture of segments of seven divergent cDNA clones (see Legend to FIG. 7).

[0021] FIGS. 9A-D. The nucleic acid and amino acid sequence of clone F3 (SEQ ID NO: 2 and SEQ ID NO: 71, respectively).

[0022] FIGS. 10A-D. The nucleic acid and amino acid sequence of clone F5 (SEQ ID NO: 3 and SEQ ID NO: 72, respectively).

[0023] FIGS. 11A-D. The nucleic acid and amino acid sequence of clone F6 (SEQ ID NO: 4 and SEQ ID NO: 73, respectively)

[0024] FIGS. 12A-D. The nucleic acid and amino acid sequence of clone F12 (SEQ ID NO: 1 and SEQ ID NO: 74, respectively).

[0025] FIGS. 13A-C. Partial nucleic acid and amino acid sequence of clone I3. Full nucleic acid and amino acid sequence of clone I3 are indicated in SEQ ID NO: 7 and SEQ ID NO: 75, respectively.

[0026] FIGS. 14A-D. The nucleic acid and amino acid sequence of clone I7 (SEQ ID NO: 8 and SEQ ID NO: 76, respectively).

[0027] FIGS. 15A-D. The nucleic acid and amino acid sequence of clone I8 (SEQ ID NO: 9 and SEQ ID NO: 77, respectively).

[0028] FIGS. 16A-D. The nucleic acid and amino acid sequence of clone I9 (SEQ ID NO: 10 and SEQ ID NO: 78, respectively).

[0029] FIGS. 17A-D. The nucleic acid and amino acid sequence of clone I14 (SEQ ID NO: 5 and SEQ ID NO: 79, respectively).

[0030] FIGS. 18A-D. The nucleic acid and amino acid sequence of clone I15 (SEQ ID NO: 6 and SEQ ID NO: 80, respectively).

[0031] FIGS. 19A-D. The nucleic acid and amino acid sequence of human clone H5 (SEQ ID NO: 11 and SEQ ID NO: 12, respectively).

[0032] FIGS. 20A-C. The nucleic acid and amino acid sequence of clone J1, where the reading frame starts at nucleotide position 2 (SEQ ID NO: 13 and SEQ ID NO: 14, respectively).

[0033] FIGS. 21A-B. The nucleic acid and amino acid sequence of clone J2 (SEQ ID NO: 15 and SEQ ID NO: 16, respectively).

[0034] FIGS. 22A-B. The nucleic acid and amino acid sequence of clone J4, where the reading frame starts at nucleotide position 2 (SEQ ID NO: 17 and SEQ ID NO: 18, respectively).

[0035] FIGS. 23A-B. The nucleic acid and amino acid sequence of clone J7, where the reading frame starts at nucleotide position 2 (SEQ ID NO: 19 and SEQ ID NO: 20, respectively).

[0036] FIGS. 24A-B. The nucleic acid and amino acid sequence of clone J8, where the reading frame starts at nucleotide position 2 (SEQ ID NO: 21 and SEQ ID NO: 22, respectively).

[0037] FIGS. 25A-C. The nucleic acid and amino acid sequence of clone J11 (SEQ ID NO: 23 and SEQ ID NO: 24, respectively).

[0038] FIGS. 26A-B. The nucleic acid and amino acid sequence of clone J14, where the reading frame starts at nucleotide position 2 (SEQ ID NO: 25 and SEQ ID NO: 26, respectively).

[0039] FIGS. 27A-B. The nucleic acid and amino acid sequence of clone J15, where the reading frame starts at nucleotide position 2 (SEQ ID NO: 27 and SEQ ID NO: 28, respectively).

[0040] FIGS. 28A-B. The nucleic acid and amino acid sequence of clone J16, where the reading frame starts at nucleotide position 2 (SEQ ID NO: 29 and SEQ ID NO: 30, respectively).

[0041] FIGS. 29A-B. The nucleic acid and amino acid sequence of clone J17, where the reading frame starts at nucleotide position 2 (SEQ ID NO: 31 and SEQ ID NO: 32, respectively).

[0042] FIGS. 30A-B. The nucleic acid and amino acid sequence of clone J19, where the reading frame starts at nucleotide position 2 (SEQ ID NO: 33 and SEQ ID NO: 34, respectively) The amino acid sequence after the stop codon is given in SEQ ID NO: 54.

[0043] FIGS. 31A-B. The nucleic acid and amino acid sequence of clone J20₇ where the reading frame starts at nucleotide position 2 (SEQ ID NO: 35 and SEQ ID NO: 36, respectively). FIG. 32. SOUTHERN BLOT: Five micrograms of DNA isolated from 1. Human placenta, 2. NCI-H-1011 neuroblastoma cells, or 3. CHP 134 neuroblastoma cells were treated with the restriction enzyme A. Eco RI, B. Hind III, C. Bam HI, or D. Pst I, and then electrophoresed on an agarose gel and blotted onto a nylon membrane. The blotted DNA was hybridized to the ³²P-labeled H3/H5 sequence. An autoradiograph of the hybridized blot is shown with the sizes of co-electrophoresed size markers noted in kilobases.

DETAILED DESCRIPTION OF THE INVENTION

[0044] Throughout this application, the following standard abbreviations are used to indicate specific amino acids: 3-character 1-character abbreviation Amino Acid abbreviation Ala Alanine A Arg Arginine R Asn Asparagine N Asp Aspartic Acid D Cys Cysteine C Gln Glutamine Q Glu Glutamic Acid E Gly Glycine G His Histidine H Ile Isoleucine I Leu Leucine L Lys Lysine K Met Methionine M Phe Phenylalanine F Pro Proline P Ser Serine S Thr Threonine T Trp Tryptophane W Tyr Tyrosine Y Val Valine V Asx Asparagine/ B Aspartic Acid Glx Glutamine/ Z Glutamic Acid *** (End) * Xxx Any amino acid or as X specified.

[0045] Having due regard to the preceding definitions, the invention provides an isolated nucleic acid molecule encoding an odorant receptor protein, wherein the receptor protein comprises seven transmembrane domains, and is further characterized by at least one of the following characteristics:

[0046] (a) the loop between the first transmembrane domain and the second transmembrane domain, and the second transmembrane domain together comprise consecutive amino acids having the following sequence:

[0047] -L, X, X, P, M, Y, X, F, L- (SEQ ID NO: 55);

[0048] (b) the third transmembrane domain, and the loop between the third transmembrane domain and the fourth transmembrane domain together comprise consecutive amino acids having one of the following sequences:

[0049] -M, X, Y, D, R, X, X, A, I, C- (SEQ ID NO: 57); or

[0050] -D, R, X, X, A, I, C- (SEQ ID NO: 59);

[0051] (c) the loop between the fifth transmembrane domain and the sixth transmembrane domain, and the sixth transmembrane domain together comprise consecutive amino acids having one of the following sequences:

[0052] -K or R, X, F, S, T, C, X, S, H- (SEQ ID NO: 61); or

[0053] -F, S, T, C, X, S, H- (SEQ ID NO: 63); or

[0054] (d) the seventh transmembrane domain and the C-terminal domain together comprise consecutive amino acids having one of the following sequences:

[0055] -P, X, X, N, P, X, I, Y, X, L, R, N- (SEQ ID NO: 65); or

[0056] -P, X, X, N, P, X, I, Y- (SEQ ID NO: 67); or

[0057] -N, P, X, I, Y, X, L, R, N- (SEQ ID NO: 69);

[0058] wherein X is any amino acid.

[0059] In one embodiment:

[0060] (a) the loop between the first transmembrane domain and the second transmembrane domain, and the second transmembrane domain together comprise consecutive amino acids having the following sequence:

[0061] -L, H or Q, K or M or T, PMY, F or L, FL- (SEQ ID NO: 56);

[0062] (b) the third transmembrane domain, and the loop between the third transmembrane domain and the fourth transmembrane domain together comprise consecutive amino acids having one of the following sequences:

[0063] -M, A or S, YDR, F or Y, L or V, AIC- (SEQ ID NO: 58); or

[0064] -DR, F or Y, L or V, AIC- (SEQ ID NO: 60);

[0065] (c) the loop between the fifth transmembrane domain and the sixth transmembrane domain, and the sixth transmembrane domain together comprise consecutive amino acids having one of the following sequences:

[0066] -K or R, A or I or S or V, FSTC, A or G or S, SH- (SEQ ID NO: 62); or

[0067] -FSTC, A or G or S, SH- (SEQ ID NO: 64); or

[0068] (d) the seventh transmembrane domain and the C-terminal domain together comprise consecutive amino acids having one of the following sequences:

[0069] -P, M or L or V, F or L or V, NP, F or I, IY, C or S or T, LRN-(SEQ ID NO: 66); or

[0070] -P, M or L or V, F or L or V, NP, F or I, IY- (SEQ ID NO: 68); or

[0071] -NP, F or I, IY, C or S or T, LRN- (SEQ ID NO: 70).

[0072] In one embodiment of the isolated nucleic acid molecule described herein, the receptor protein is characterized by at least two of the characteristics (a) through (d). In one embodiment, the receptor protein is characterized by at least three of the characteristics (a) through (d). In one embodiment, the receptor protein is characterized by all of the characteristics (a) through (d).

[0073] The invention provides an isolated nucleic acid molecule encoding an odorant receptor protein, wherein the nucleic acid molecule encodes a protein selected from the group consisting of:

[0074] (a) an odorant receptor protein comprising consecutive amino acids having a sequence identical to that beginning with methionine at position 1 and ending with tyrosine at position 333 as set forth in row F3 of FIGS. 4A to 4M (SEQ ID NO: 71),

[0075] (b) an odorant receptor protein comprising consecutive amino acids having a sequence identical to that beginning with methionine at position 1 and ending with glutamine at position 313 as set forth in row F5 of FIGS. 4A to 4L (SEQ ID NO: 72),

[0076] (c) an odorant receptor protein comprising consecutive amino acids having a sequence identical to that beginning with methionine at position 1 and ending with lysine at position 311 as set forth in row F6 of FIGS. 4A to 4L (SEQ ID NO: 73),

[0077] (d) an odorant receptor protein comprising consecutive amino acids having a sequence identical to that beginning with methionine at position 1 and ending with glycine at position 317 as set forth in row F12 of FIGS. 4A to 4L (SEQ ID NO: 74),

[0078] (e) an odorant receptor protein comprising consecutive amino acids having a sequence identical to that beginning with methionine at position 1 and ending with leucine at position 310 as set forth in row I3 of FIGS. 4A to 4L (SEQ ID NO: 75),

[0079] (f) an odorant receptor protein comprising consecutive amino acids having a sequence identical to that beginning with methionine at position 1 and ending with glycine at position 327 as set forth in row I7 of FIGS. 4A to 4L (SEQ ID NO: 76),

[0080] (g) an odorant receptor protein comprising consecutive amino acids having a sequence identical to that beginning with methionine at position 1 and ending with tryptophan at position 312 as set forth in row I8 of FIGS. 4A to 4L (SEQ ID NO: 77),

[0081] (h) an odorant receptor protein comprising consecutive amino acids having a sequence identical to that beginning with methionine at position 1 and ending with leucine at position 314 as set forth in row I9 of FIGS. 4A to 4L (SEQ ID NO: 78),

[0082] (i) an odorant receptor protein comprising consecutive amino acids having a sequence identical to that beginning with methionine at position 1 and ending with leucine at position 312 as set forth in row I14 of FIGS. 4A to 4L (SEQ ID NO: 79),

[0083] (j) an odorant receptor protein comprising consecutive amino acids having a sequence identical to that beginning with methionine at position 1 and ending with leucine at position 314 as set forth in row I15 of FIGS. 4A to 4L (SEQ ID NO: 80), and

[0084] (k) an odorant receptor protein that shares from 40-80% amino acid identity with any one of the proteins of (a)-(j), comprises seven transmembrane domains, and is further characterized by at least one of the following characteristics:

[0085] (i) the loop between the first transmembrane domain and the second transmembrane domain, and the second transmembrane domain together comprise consecutive amino acids having the following sequence: -L, X, X, P, M, Y, X, F, L- (SEQ ID NO: 55);

[0086] (ii) the third transmembrane domain, and the loop between the third transmembrane domain and the fourth transmembrane domain together comprise consecutive amino acids having one of the following sequences:

[0087] -M, X, Y, D, R, X, X, A, I, C- (SEQ ID NO: 57); or

[0088] -D, R, X, X, A, I, C- (SEQ ID NO: 59);

[0089] (iii) the loop between the fifth transmembrane domain and the sixth transmembrane domain, and the sixth transmembrane domain together comprise consecutive amino acids having one of the following sequences:

[0090] -K or R, X, F, S, T, C, X, S, H- (SEQ ID NO: 61); or

[0091] -F, S, T, C, X, S, H- (SEQ ID NO: 63); or

[0092] (iv) the seventh transmembrane domain and the C-terminal domain together comprise consecutive amino acids having one of the following sequences:

[0093] -P, X, X, N, P, X, I, Y, X, L, R, N- (SEQ ID NO: 65); or

[0094] -P, X, X, N, P, X, I, Y- (SEQ ID NO: 67); or

[0095] -N, P, X, I, Y, X, L, R, N- (SEQ ID NO: 69);

[0096] wherein X is any amino acid.

[0097] In one embodiment:

[0098] (i) the loop between the first transmembrane domain and the second transmembrane domain, and the second transmembrane domain together comprise consecutive amino acids having the following sequence:

[0099] -L, H or Q, K or M or T, PMY, F or L, FL- (SEQ ID NO: 56);

[0100] (ii) the third transmembrane domain, and the loop between the third transmembrane domain and the fourth transmembrane domain together comprise consecutive amino acids having one of the following sequences:

[0101] -M, A or S, YDR, F or Y, L or V, AIC- (SEQ ID NO: 58); or

[0102] -DR, F or Y, L or V, AIC- (SEQ ID NO: 60);

[0103] (iii) the loop between the fifth transmembrane domain and the sixth transmembrane domain, and the sixth transmembrane domain together comprise consecutive amino acids having one of the following sequences:

[0104] -K or R, A or I or S or V, FSTC, A or G or S, SH- (SEQ ID NO: 62); or

[0105] -FSTC, A or G or S, SH- (SEQ ID NO: 64); or

[0106] (iv) the seventh transmembrane domain and the C-terminal domain together comprise consecutive amino acids having one of the following sequences:

[0107] -P, M or L or V, F or L or V, NP, F or I, IY, C or S or T, LRN- (SEQ ID NO: 66); or

[0108] -P, M or L or V, F or L or V, NP, F or I, IY- (SEQ ID NO: 68); or

[0109] -NP, F or I, IY, C or S or T, LRN- (SEQ ID NO: 70).

[0110] The invention provides an isolated nucleic acid molecule encoding an odorant receptor protein, wherein the nucleic acid molecule comprises a nucleic acid sequence which can be amplified by polymerase chain reaction using:

[0111] (a) any one of 5′ primers Al (SEQ ID NO: 37), A2 (SEQ ID NO: 38), A3 (SEQ ID NO: 39), A4 (SEQ ID NO: 40), or A5 (SEQ ID NO: 41); and

[0112] (b) any one of 3′ primers Bl (SEQ ID NO: 42), B2 (SEQ ID NO: 43), B3 (SEQ ID NO: 44), B4 (SEQ ID NO: 45), B5 (SEQ ID NO: 46), or B6 (SEQ ID NO: 47).

[0113] The invention provides an isolated nucleic acid molecule encoding an odorant receptor protein, wherein the nucleic acid molecule comprises:

[0114] (a) a nucleic acid sequence given in any one of FIGS. 9 to 31 (SEQ ID Nos: 1-10, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, or 35); or

[0115] (b) a nucleic acid sequence degenerate to a sequence of (a) as a result of the genetic code.

[0116] In one embodiment, the odorant receptor protein encoded by any of the isolated nucleic acid molecules described herein comprises seven transmembrane domains. In one embodiment, the loop between the fifth and sixth transmembrane domains consists of 17 amino acids.

[0117] An odorant receptor is a receptor which binds an odorant ligand and includes but is not limited to pheromone receptors. An odorant ligand may include, but is not limited to, molecules which interact with the olfactory sensory neuron, molecules which interact with the olfactory cilia, pheromones, and molecules which interact with structures within the vomeronasal organ.

[0118] In one embodiment, the odorant receptor protein encoded by any of the isolated nucleic acid molecules described herein is a vertebrate odorant receptor. In one embodiment, the vertebrate odorant receptor is a fish odorant receptor or a mammalian odorant receptor. In one embodiment, the mammalian odorant receptor is a human odorant receptor, a rat odorant receptor, a mouse odorant receptor or a dog odorant receptor.

[0119] In one embodiment, the isolated nucleic acid molecule described herein is deoxyribonucleic acid (DNA). In one embodiment, the DNA is cDNA.

[0120] In an embodiment, a human odorant receptor CDNA sequence and the corresponding protein are isolated (SEQ ID Nos: 11 and 12) (FIG. 19)

[0121] In another embodiment, pheromone receptors are isolated and shown as clones J1 (SEQ ID NOs: 13 and 14), J2 (SEQ ID NOs: 15 and 16), J4 (SEQ ID NOs: 17 and 18), J7 (SEQ ID NOs: 19 and 20), J8 (SEQ ID NOs: 21 and 22), J11 (SEQ ID NOs: 23 and 24), J14 (SEQ ID NOs: 25 and 26), J15 (SEQ ID NOs: 27 and 28), J16 (SEQ ID NOs: 29 and 30), J17 (SEQ ID NOs: 31 and 32), J19 (SEQ ID NOs: 33 and 34) and J20 (SEQ ID NOs: 35 and 36)(FIGS. 20-31).

[0122] The invention provides a vector comprising any of the isolated nucleic acid molecules described herein. In one embodiment, the vector additionally comprises elements necessary for replication and expression in a suitable host. Such expression vectors are well known in the art. Suitable hosts are well known in the art and include without limitation bacterial hosts such as E. coli, animal hosts such as CHO cells, insect cells, yeast cells and the like.

[0123] The invention provides a purified odorant receptor protein encoded by any of the isolated nucleic acid molecules described herein. Such proteins may be prepared by expression of the aforementioned expression vectors in suitable host cells, and recovery and purification of the receptors using methods well known in the art. Examples of such proteins include those having the amino acid sequences shown in FIGS. 9-31.

[0124] The invention provides methods of transforming cells which comprises transfecting a suitable host cell with a suitable expression vector containing nucleic acid encoding the odorant receptor. Techniques for carrying out such transformations on cells are well known to those skilled in the art (41,42).

[0125] The invention provides a cell transfected with any of the vectors described herein. In one embodiment, the cell is an olfactory cell. In one embodiment, the cell is a non-olfactory cell. In one embodiment, prior to being transfected with the vector, the non-olfactory cell does not express an odorant receptor protein. One advantage of using such transformed non-olfactory cells is that the desired odorant receptor will be the only odorant receptor expressed on the cell's surface.

[0126] In order to obtain cell lines that express a single receptor type, standard procedures may be used to clone individual cDNAs or genes into expression vectors and then transfect the cloned sequences into mammalian cell lines. This approach has been used with sequences encoding some other members of the seven transmembrane domain superfamily including the 5-HT₁c serotonin receptor (43). The cited work illustrates how members of this superfamily transferred into cell lines may generate immortal cell lines that express high levels of the transfected receptor on the cell surface where it will bind ligand and that such abnormally expressed receptor molecules can transduce signals upon binding to ligand.

[0127] The invention provides a method of identifying a desired odorant ligand, which comprises contacting any of the non-olfactory cells described herein, which express on its cell surface a known odorant receptor, with a series of odorant ligands and determining which ligands bind to the known odorant receptor on the non-olfactory cell.

[0128] The invention provides a method of identifying a desired odorant receptor, which comprises contacting a series of any of the non-olfactory cells described herein with a known odorant ligand and determining which odorant receptor binds with the odorant ligand.

[0129] The invention provides a method of detecting an odor which comprises:

[0130] (a) identifying an odorant receptor which binds the desired odorant ligand identified by any of the methods described herein; and

[0131] (b) imbedding the receptor in a membrane such that when the odorant ligand binds with the receptor identified in (a) above, a detectable signal is produced.

[0132] In one embodiment of the described method, the desired odorant ligand is a pheromone. In different embodiments, the desired odorant ligand is the vapor emanating from cocaine, marijuana, heroin, hashish, angel dust, gasoline, natural gas, alcohol, decayed human flesh, gun powder, an explosive, a plastic explosive, or a firearm.

[0133] In different embodiments, the desired odorant ligand is a toxic fume, a noxious fume or a dangerous fume. In different embodiments, the membrane is a cell membrane, an olfactory cell membrane, or a synthetic membrane. In different embodiment of the methods described herein, the detectable signal is a color change, a phosphorescence, a radioactivity, a visual signal, or an auditory signal.

[0134] The invention provides a method of quantifying the amount of an odorant ligand present in a sample which comprises any of the methods described herein wherein the detectable signal is quantified.

[0135] The invention provides a method of developing fragrances, which comprises identifying a desired odorant receptor by any of the methods described herein, then contacting a non-olfactory cell, which has been transfected with an expression vector comprising an isolated nucleic acid molecule encoding the desired odorant receptor such that the receptor is expressed upon the surface of the non-olfactory cell, with a series of compounds to determine which compounds bind with the receptor.

[0136] The invention provides a method of identifying an odorant fingerprint, which comprises contacting a series of cells, which have been transformed such that each express a known odorant receptor encoded by any of the nucleic acid molecules described herein, with a desired sample containing one or more odorant ligand and determining the type and quantity of the odorant ligands present in the sample.

[0137] The invention provides a method of identifying a compound which inhibits an odorant receptor, which comprises contacting an odorant receptor encoded by any of the nucleic acid molecules described herein with a series of compounds and determining which compound inhibits interaction between the odorant receptor and an odorant ligand known to bind to the odorant receptor.

[0138] The invention provides a method for identifying an appetite suppressant compound, which comprises identifying a compound by any of the methods described herein wherein the odorant receptor is associated with the perception of food.

[0139] The invention provides a nasal spray for controlling appetite, which comprises a compound identified by the methods described herein in a suitable carrier.

[0140] The invention provides a method for controlling appetite in a subject, which comprises administering to the subject an amount of a compound identified by the methods described herein effective to control the subject's appetite. In one embodiment, the method comprises administering the compound to the subject's olfactory epithelium.

[0141] The invention provides a method of trapping odors, which comprises contacting a membrane comprising a plurality of a desired odorant receptor encoded by any of the nucleic acid molecules described herein with a sample comprising a desired odorant ligand such that the desired odorant ligand is absorbed by the binding of the odorant ligand to the odorant receptor.

[0142] The invention provides an odor trap, which comprises a membrane comprising a plurality of a desired odorant receptor encoded by any of the nucleic acid molecules described herein, such that a desired odorant ligand is absorbed by the binding of the odorant ligand to the odorant receptor.

[0143] The invention provides a method for controlling a pest population in an area, which comprises spraying the area with an odorant receptor ligand identified by the method described herein. In one embodiment, the odorant ligand is an alarm odorant ligand. In one embodiment, the odorant ligand interferes with an interaction between an odorant ligand and an odorant receptor associated with fertility. In different embodiments, the pest population is a population of rodents, mice, or rats.

[0144] The invention provides a method of promoting fertility in a subject which comprises administering to the subject an amount of an odorant ligand identified by the methods described herein effective to promote the subject's fertility. In one embodiment, the odorant ligand interacts with an odorant receptor associated with fertility. In one embodiment, the method comprises administering the odorant ligand to the subject's olfactory epithelium.

[0145] The invention provides a method of inhibiting fertility in a subject which comprises administering to the subject an amount of an odorant ligand identified by the methods described herein effective to inhibit the subject's fertility. In one embodiment, the odorant ligand inhibits an interaction between an odorant ligand and an odorant receptor associated with fertility. In one embodiment, the method comprises administering the odorant ligand to the subject's olfactory epithelium.

[0146] The invention provides the use of an odorant ligand identified by the methods described herein for the preparation of a pharmaceutical composition for controlling a pest population in a desired area by spraying the desired area with the identified odorant ligand. In one embodiment, the odorant ligand is an alarm odorant ligand. In different embodiments, the pest population is a population of rodents, mice, or rats.

[0147] The invention provides the use of an odorant ligand identified by the methods described herein for the preparation of a pharmaceutical composition for controlling a pest population. In one embodiment, the odorant ligand interferes with the interaction between odorant ligands and odorant receptors which are associated with fertility. In different embodiments, the pest population is a population of rodents, mice, or rats.

[0148] The invention provides the use of an odorant ligand identified by the methods described herein for the preparation of a pharmaceutical composition for promoting fertility. In one embodiment, the odorant ligand interacts with odorant receptors associated with fertility.

[0149] The invention provides the use of an odorant ligand identified by the methods described herein for the preparation of a pharmaceutical composition for inhibiting fertility. In one embodiment, the odorant ligand inhibits the interaction between odorant ligands and odorant receptors associated with fertility.

[0150] The invention provides the use of the compound identified by the methods described herein for the preparation of a pharmaceutical composition for controlling appetite in a subject.

[0151] The invention provides a pharmaceutical composition comprising any of the compounds or odorant ligands identified by any of the methods described herein and a pharmaceutically acceptable carrier. The phrase “pharmaceutically acceptable carrier” means any of the standard pharmaceutically acceptable carriers. Examples include, but are not limited to, phosphate buffered saline, physiological saline, water, and emulsions, such as oil/water emulsions. In one embodiment, the composition can be applied to a subject's olfactory epithelium.

[0152] This invention will be better understood from the Experimental Details which follow. However, one skilled in the art will readily appreciate that the specific methods and results discussed are merely illustrative of the invention as described more fully in the claims which follow thereafter.

[0153] Experimental Details

[0154] Materials And Methods

[0155] Polymerase Chain Reaction

[0156] RNA was prepared from the olfactory epithelia of Sprague Dawley rats according to Chirgwin et al. (40) or using RNAzol B (Cinna/Biotecx) and then treated with DNase I (0.1 unit/μg RNA) (Promega). In order to obtain cDNA, this RNA was incubated at 0.1 μg/yl with 5 μM random hexamers (Pharmacia), 1 mM each of DATP, dCTP, dGTP, TTP, and 2 units/μ1 RNase inhibitor (Promega) in 10 mM TrisCl (pH 8.3), 50 mM KCl, 2.5 mM MgCl₂, and 0.001% gelatin for 10 min. at 22° C., and then for a further 45 min. at 37° C. following the addition of 20 units/μl of Moloney murine leukemia virus reverse transcriptase (BRL). After heating at 95° C. for 3 min., cDNA prepared from 0.2 μg of RNA was used in each of a series of polymerase chain reactions (PCR) containing 10 mM TrisCl (pH 8.3), 50 mM KCl, 1.5 mM MgCl₂, 0.001% gelatin, 200 μM each of DATP, dCTP, dGTP, and TTP, 2.5 units Taq polymerase (Perkin Elmer Cetus), and 2 μM of each PCR primer. PCR reactions were performed according to the following schedule: 96° C. for 45 sec., 55° C. for 4 min. (or 45° C. for 2 min.), 72° C. for 3 min. with 6 sec. extension per cycle for 48 cycles. The primers used for PCR were a series of degenerate oligonucleotides made according to the amino acid sequences found in transmembrane domain 2 and 7 of a variety of different members of the 7 transmembrane domain protein superfamily (19). The regions used correspond to amino acids number 60-70 and 286-295 of clone I15 (FIG. 4). Each of five different 5′ primers were used in PCR reactions with each of six different 3′ primers. The 5′ primers had the sequences: A1, AA(T/C)T(G/A)(G/C)ATI(C/A)TI(G/C)TIAA(T/C)(C/T)TIG CIGTIGCIGA (SEQ ID NO: 37); A2, AA(T/C)TA(T/C)TT(T/C)(C/A)TI(G/A)TIAA(T/C)CTIGCI (T/C)TIGCIGA (SEQ ID NO: 38); A3, AA(T/C)(T/C)(T/A)ITT(T/C)(A/C)TIATI(T/A)CICTIGCIT (G/C)IGCIGA (SEQ ID NO: 39); A4, (C/A)GITTI(C/T)TIATGTG(T/C)AA(C/T)CTI(T/A)(G/C) (C/T)TT(T/C)GCIGA (SEQ ID NO: 40); and A5, ACIGTITA(T/C)ATIACICA(T/C)(C/T)TI(A/T)(C/G)IATIGCI GA (SEQ ID NO: 41) The 3′ primers were: B1, CTGI(C/T)(G/T)(G/A)TTCATIA(A/T)I(A/C)(C/A) (A/G)TAIA(T/C)IA(T/C)IGG(G/A)TT (SEQ ID NO: 42); B2, (G/T)(A/G)T(C/G)(G/A)TTIAG(A/G)CA(A/G)CA(A/G)TAIAT IATIGG(G/A)TT (SEQ ID NO: 43); B3, TCIAT(G/A)TT(A/G)AAIGTIGT(A/G)TAIATIATIGG(G/A)TT (SEQ ID NO: 44) B4, GC(C/T)TTIGT(A/G)AAIATIGC(A/G)TAIAG(G/A)AAIGG (G/A)TT (SEQ ID NO: 45) B5, AA(A/G)TCIGG(G/A)(C/G)(T/A)ICGI(C/G)A(A/G)TAIAT (C/G)AIIGG(G/A)TT (SEQ ID NO: 46); and BG,(G/C)(A/T)I(G/C)(A/T)ICCIAC(A/G)AA(A/G)AA(A/G) TAIAT(A/G)AAIGG(G/A)TT (SEQ ID NO: 47).

[0157] In the preceding formulae, each parenthesis encloses amino acids which are alternatives to one other, and each slash within such parentheses separates such alternative amino acids.

[0158] An aliquot of each PCR reaction was analyzed by agarose gel electrophoresis and bands of interest were amplified further by performing PCR reactions on pipette tip (approx. 1 μl) plugs of the agarose gels containing those DNAs. Aliquots of these semi-purified PCR products were digested with the restriction enzymes Hae III or Hinf I and the digestion products were compared with the undigested DNAs on agarose gels.

[0159] Isolation and Analysis of cDNA Clones

[0160] CDNA libraries were prepared according to standard procedures (41, 42) in the cloning vector, λZAP II (Stratagene) using poly A⁺ RNA prepared from Sprague Dawley rat epithelia (see above) or from an enriched population of olfactory neurons which had been obtained by a ‘panning’ procedure, using an antibody against the H blood group antigen (Chembiomed) found on a large percentage of rat olfactory neurons. In initial library screens, 8.5×10⁵ independent clones from the olfactory neuron library and 1.8×10⁶ clones from the olfactory epithelium library were screened (41) with a ³²P-labeled probe (prime-it, Stratagene) consisting of a pool of gel-isolated PCR products obtained using primers A4 and B6 (see above) in PCR reactions using as template, olfactory epithelium cDNA, rat liver DNA, or DNA prepared from the two cDNA libraries. In later library screens, a mixture of PCR products obtained from 20 cDNA clones with the A4 and B6 primers was used as probe (‘P1’ probe). In initial screens, phage clones were analyzed by PCR using primers A4 and B6 and those which showed the appropriate size species were purified. In later screens, all position clones were purified, but only those that could be amplified with the B6 primer and a primer specific for vector sequence were analyzed further. To obtain plasmids from the isolated phage clones, phagemid rescue was performed according to the instructions of the manufacturer of λZAP II (Stratagene). DNA sequence analysis was performed on plasmid DNAs using the Sequenase system (USB), initially with the A4 and B6 primers and later with oligonucleotide primers made according to sequences already obtained.

[0161] Northern and Southern Blot Analyses

[0162] For Northern blots, poly A⁺ RNAs from various tissues were prepared as described above or purchased from Clontech. One μg of each RNA was size fractionated on formaldehyde agarose gels and blotted onto nylon membranes (41, 42). For Southern blots, genomic DNA prepared from Sprague Dawley rat liver was digested with the restriction enzymes Eco RI or Hind III, size fractionated on agarose gels and blotted onto nylon membranes (41, 42). The membranes were dried at 80° C., and then prehybridized in 0.5 M sodium phosphate buffer (pH 7.3) containing 1% bovine serum albumin and 4% sodium dodecyl sulfate. Hybridization was carried out in the same buffer at 65°-70° C. for 14-20 hrs. with DNAs labeled with ³²P. For the first Northern blot shown, the ‘P1’ probe (see above under cDNA clone isolation) was used. For the second Northern blot shown, a mix of PCR fragments from seven divergent cDNA clones was used. For Southern blots, the region indicated in clone I15 by amino acids 118 through 251 was amplified from a series of divergent cDNA clones using PCR. The primers used for these reactions had the sequences: P1, ATGGCITA(T/C)GA(T/C)(C/A)GITA(T/C)GTIGC (SEQ ID NO: 48), and P4, AAIA(G/A)I(G/C)(A/T)IACIA(T/C)I(G/C)(A/T)IA (G/A)(A/G)TGI(G/C)(A/T)I(C/G)C (SEQ ID NO: 49).

[0163] In the preceding formulae, each parenthesis encloses amino acids which are alternatives to one other, and each slash within such parentheses separates such alternative amino acids.

[0164] These DNAs (or a DNA encompassing transmembrane domains 2 through 7 for clone F6) were labeled and tested for crosshybridization at 70° C. Those DNAs which did not show appreciable crosshybridization were hybridized individually, or as a pool to Southern blots at 70° C.

[0165] Rat Sequences Used to Obtain Similar Sequences Expressed in Humans

[0166] There are genes similar to the rat genes discussed above present in humans, these genes may be readily isolated by screening human gene libraries with the cloned rat sequences or by performing PCR experiments on human genomic DNA with primers homologous to the rat sequences. First, PCR experiments were performed with genomic DNA from rat, human, mouse, and several other species. When primers homologous to transmembrane domains 2 and 6 (the A4/B6 primer set used to isolate the original rat sequences) were used, DNA of the appropriate size was amplified from rat, human and mouse DNAs. When these primary PCR reactions were subsequently diluted and subjected to PCR using primers to internal sequences (P1 and P4 primers), smaller DNA species were amplified whose size was that seen when the same primers were used in PCR reactions with the cloned rat cDNAs. Similarly, when the secondary PCR was performed with one outer primer together with one inner primer (i.e. A4/P4 or P1/B6), amplified DNAs were obtained whose sizes were also consistent with the amplification of genes similar in sequence and organization to the cloned rat cDNAs. Second, a mix of segments from 20 of the rat cDNAs (‘Pi1” probe) was used to screen libraries constructed from human genomic DNAs. Hybridization under high or low stringency conditions reveals the presence of a large number of cloned human DNA segments that are homologous to the rat sequences. Finally, RNA from a human olfactory tumor (neuroesthesioma, NCI-H-1011) cell line has been examined for sequences homologous to those cloned in the rat. cDNA prepared from this RNA was subjected to PCR with the A4/B6 primer set and a DNA species of the appropriate size was seen. This DNA was subcloned and partially sequenced and clearly encodes a member of the olfactory protein family identified in the rat.

[0167] The inserted sequence in human clones H3/H5 was amplified by PCR with the A4/B6 primers, gel purified, and then labeled with ³²P. The labeled DNA was then hybridized to restriction enzyme human placenta. Multiple hybridizing species were observed with each DNA (see FIG. 32). This observation is consistent with the presence of a family of odorant receptor genes in the human genome.

[0168] The sequence of clone H5 is hereby shown in FIG. 19. In addition, the translated protein sequence is shown in FIG. 19.

[0169] In other to identify odorant receptors in other species, degenerated primer oligonucleotides homologous to conserved regions within the rat odorant receptor family may be used in PCR reactions with genomic DNA or with cDNA prepared from olfactory tissue RNA from those species.

[0170] Results

[0171] Cloning the Gene Family

[0172] A series of degenerate oligonucleotides were designated which could anneal to conserved regions of members of the superfamily of G-protein coupled seven transmembrane domain receptor genes. Five degenerate oligonucleotides (A1-5; see Experimental Procedures) matching sequences within transmembrane domain 2, and six degenerate oligonucleotides (B1-6) matching transmembrane domain 7 were used in all combinations in PCR reactions to amplify homologous sequences in cDNA prepared from rat olfactory epithelium RNA. The amplification products of each PCR reaction were then analyzed by agarose gel electrophoresis. Multiple bands were observed with each of the primer combinations. The PCR products within the size range expected for this family of receptors (600 to 1300 bp) were subsequently picked and amplified further with the appropriate primer pair in order to isolate individual PCR bands. Sixty-four PCR bands isolated in this fashion revealed only one or a small number of bands upon agarose gel electrophoresis. Representatives of these isolated PCR products are shown in FIG. 2A.

[0173] The isolated PCR products were digested with the endonuclease, Hae III or Hinf I, which recognize four base restriction sites and cut DNA at frequent intervals. In most instances, digestion of the PCR product with Hinf I generated a set of fragments whose molecular weights sum to the size of the original DNA (FIG. 2B). These PCR bands are therefore likely to each contain a single DNA species. In some cases, however, restriction digestion yielded a series of fragments whose molecular weights sum to a value greater than that of the original PCR product. The most dramatic example is shown in FIG. 2 where the 710 bp, PCR 13 DNA, is cleaved by Hinf I to yield a very large number of restriction fragments whose sizes sum to a value five- to ten-fold greater than that of the original PCR product. These observations indicated that PCR product 13 consists of a number of different species of DNA, each of which could be amplified with the same pair of primer oligonucleotides. In addition, when PCR experiments similar to those described were performed using cDNA library DNAs as templates, a 710 bp PCR product was obtained with the PCR13 primer pair (A4/B6) with DNA from olfactory cDNA libraries, but not a glioma cDNA library. Moreover, digestion of one of this 710 bp product also revealed the presence of multiple DNA species. In other cases (see PCR product 20, for example), digestion yielded a series of restriction fragments whose molecular weights also sum to a size greater than the starting material. Further analysis, however, revealed that the original PCR product consisted of multiple bands of similar but different sizes.

[0174] In order to determine whether the multiple DNA species present in PCR 13 encode members of a family of seven transmembrane domain proteins, PCR 13 DNA was cloned into the plasmid vector Bluescript and five individual clones were subjected to DNA sequence analysis. Each of the five clones exhibited a different DNA sequence, but each encoded a protein which displayed conserved features of the superfamily of seven transmembrane domain receptor proteins. In addition, the proteins encoded by all five clones shared distinctive sequence motifs not found in other superfamily members indicating they were all members of a new family of receptors.

[0175] To obtain full-length cDNA clones, cDNA libraries prepared from olfactory epithelium RNA or from RNA of an enriched population of olfactory sensory neurons were screened. The probe used in these initial screens was a mixture of PCR 13 DNA as well as DNA obtained by amplification of rat genomic DNA or DNA from two olfactory cDNA libraries with the same primers used to generate PCR 13 (A4 and B6 primers). Hybridizing plaques were subjected to PCR amplification with the A4/B6 primer set and only those giving a PCR product of the appropriate size (approximately 710 bp) were purified. The frequency of such positive clones in the enriched olfactory neuron cDNA library was approximately five times greater than the frequency in the olfactory epithelium cDNA library. The increased frequency of positive clones observed in the olfactory neuron library is comparable to the enrichment in olfactory neurons generally obtained in the purification procedure.

[0176] The original pair of primers used to amplify PCR 13 DNA were then used to amplify coding segments of 20 different cDNA clones. A mix of these PCR products were labeled and used as probe for further cDNA library screens. This mixed probe was also used in a Northern blot (FIG. 3) to determine whether the expression of the gene family is restricted to the olfactory epithelium. The mixed probe detects two diffuse bands centered at 2 and 5 kb in RNA from olfactory epithelium; no hybridization can be detected in brain or spleen. (Later experiments which examined a larger number of tissue RNAs with a more restricted probe will be shown below.) Taken together, these data indicate the discovery of a novel multigene family encoding seven transmembrane domain proteins which are expressed in olfactory epithelium, and could be expressed predominantly or exclusively in olfactory neurons.

[0177] The Protein Sequences of Numerous, Olfactory-specific Members of the Seven Transmembrane Domain Superfamily

[0178] Numerous clones were obtained upon screening cDNA libraries constructed from olfactory epithelium and olfactory neuron RNA at high stringency. Partial DNA sequences were obtained from 36 clones; 18 of these cDNA clones are different, but all of them encode proteins which exhibit shared sequence motifs indicating that they are members of the family identified in PCR 13 DNA. A complete nucleotide sequence was determined for coding regions of ten of the most divergent clones (FIG. 4). The deduced protein sequences of these cDNAs defines a new multigene family which shares sequence and structural properties with the superfamily of neurotransmitter and hormone receptors that traverse the membrane seven times. This novel family, however, exhibits features different from any other member of the receptor superfamily thus far identified.

[0179] Each of the ten sequences contains seven hydrophobic stretches (19-26 amino acids) that represent potential transmembrane domains. These domains constitute the regions of maximal sequence similarity to other members of the seven transmembrane domain superfamily (see legend to FIG. 4). On the basis of structural homologies with rhodopsin and the β-adrenergic receptors, (19) it is likely that the amino termini of the olfactory proteins are located on the extracellular side of the plasma membrane and the carboxyl termini are located in the cytoplasm. In this scheme, three extracellular loops alternate with three intracellular loops to link the seven transmembrane domains (see FIG. 5). Analysis of the sequences in FIG. 4 demonstrates that the olfactory proteins, like other members of the receptor superfamily, display no evidence of an N-terminal signal sequence. As in several other superfamily members, a potential N-linked glycosylation site is present in all ten proteins within the short N-terminal extracellular segment. Other structural features conserved with previously identified members of the superfamily included cysteine residues at fixed positions within the first and second extracellular loops that are thought to form a disulfide bond. Finally, many of the olfactory proteins reveal a conserved cysteine within the C-terminal domain which may serve as a palmitoylation site anchoring this domain to the membrane (21). These features, taken together with several short, conserved sequence motifs (see legend to FIG. 4), clearly define this new family as a member of the superfamily of genes encoding the seven transmembrane domain receptors.

[0180] There are, however, important differences between the olfactory protein family and the other seven transmembrane domain proteins described previously and these differences may be relevant to proposed function of these proteins in odor recognition. Structure-function experiments involving in vitro mutagenesis suggest that adrenergic ligands interact with this class of receptor molecule by binding within the plane of the membrane (22, 20). Not surprisingly, small receptor families that bind the same class of ligands, such as the adrenergic and muscarinic acetylcholine receptor families exhibit maximum sequence conservation (often over 80%) within the transmembrane domains. In contrast, the family of receptors discussed in this application shows striking divergence within the third, fourth, and fifth transmembrane domains (FIG. 4). The variability in the three central transmembrane domains is highlighted schematically in FIG. 5. The divergence in potential ligand binding domains is consistent with the idea that the family of molecules cloned is capable of associating with a large number of odorant of diverse molecular structure.

[0181] Receptors which belong to the superfamily of seven transmembrane domain proteins interact with G-proteins to generate intracellular signals. In vitro mutagenesis experiments indicate that one site of association between receptor and G-protein resides within the third cytoplasmic loop (22, 23). The sequence of this cytoplasmic loop in 18 different clones we have characterized is shown in FIG. 6A. This loop which is often quite long and of variable length in the receptor superfamily is relatively short (only 17 amino acids) and of fixed length in the 18 clones examined. Eleven of the 18 different clones exhibit the sequence motif K/R I V S S I (SEQ ID NO: 50 and SEQ ID NO: 51) (or a close relative) at the N-terminus of this loop. Two of the cDNA clones reveal a different H I T C/W A V (SEQ ID NO: 52 and SEQ ID NO: 53) motif at this site. If this short loop is a site of contact with G-proteins, it is possible that the conserved motifs may reflect sites of interaction with different G-proteins that activate different intracellular signaling systems in response to odors. In addition, the receptors cloned reveal several serine or threonine residues within the third cytoplasmic loop. By analogy with other G-protein coupled receptors, these residues may represent sites of phosphorylation for specific receptor kinases involved in desensitization (24).

[0182] Subfamilies within the Multigene Family

[0183]FIG. 6A displays the sequences of the fifth transmembrane domain and the adjacent cytoplasmic loop encoded by L8 of the cDNA clones we have analyzed. As a group, the 18 sequences exhibit considerable divergence within this region. The multigene family, however, can be divided into subfamilies such that the members of a given subfamily share significant sequence conservation.

[0184] In subfamily B, clones F12 and F13, for example, differ from one another at only four of 44 positions (91% identify), and clearly define a subfamily. Clones F5 and Ill (subfamily D) differ from F12 and F13 at 34-36 positions within this region and clearly define a separate subfamily. Thus, this olfactory-specific multigene family consists of highly divergent subfamilies. If these genes encode odor receptors, it is possible that members of the divergent subfamilies bind odorant of widely differing structural classes. Members of the individual subfamilies could therefore recognize more subtle differences between molecules which belong to the same structural class of molecules structures.

[0185] The Size of the Multigene Family

[0186] Genomic Southern blotting experiments were preformed and genomic libraries were screened to obtain an estimate of the sizes of the multigene family and the member subfamilies encoding the putative odor receptors. DNAs extending from the 3′ end of transmembrane domain 3 to the middle of transmembrane domain 6 were synthesized by PCR from DNA of seven of the divergent cDNA clones (FIG. 4). In initial experiments, these DNAs were labeled and hybridized to each other to define conditions under which minimal crosshybridization would be observed among the individual clones. At 70° C., the seven DNAs showed no crosshybridization, or crossnybridized only very slightly. The trace levels of crosshybridization observed are not likely to be apparent upon genomic Southern blot analysis where the amounts of DNA are far lower than in the test cross.

[0187] Probes derived from these seven DNAs were annealed under stringent conditions, either individually or as a group, to Southern blots of rat liver DNA digested with the restriction endonucleases Eco RI or Hind III (FIG. 7).

[0188] Examination of the Southern blots reveals that all but one of the cDNAs detects a relatively large, distinctive array of bands in genomic DNA. Clone I15 (probe 7), for example, detects about 17 bands with each restriction endonuclease, whereas clone F9 (probe 1) detects only about 5-7 bands with each enzyme. A single band is obtained with clone I7 (probe 5). PCR experiments using nested primers (TM2/TM7 primers followed by primers to internal sequences) and genomic DNA as template indicate that the coding regions of the members of this multigene family, like those of many members of the G-protein coupled superfamily, may not be interrupted by introns. This observation, together with the fact that most of the probes only encompasses 400 nucleotides suggests that each band observed in these experiments is likely to represent a different gene. These data suggest that the individual probes chosen are representatives of subfamilies which range in size from a single member to as many as 17 members. A total of about 70 individual bands were detected in this analysis which could represent the presence of at least 70 different genes. Although the DNA probes used in these blots did not crosshybridize appreciably with each other, it is possible that a given gene might hybridize to more than one probe, resulting in an overestimate of gene number. However, it is probable that the total number of bands only reflects a minimal estimate of gene number since it is unlikely that we have isolated representative cDNAs from all of the potential subfamilies and the hybridizations were performed under conditions of very high stringency.

[0189] A more accurate estimate of the size of the olfactory-specific gene family was obtained by screening rat genomic libraries. The mix of the seven divergent probes used in Southern blots, or the mix of 20 different probes used in our initial Northern blots (see FIG. 3), were used as hybridization probes under high (65° C.) or lowered (55° C.) stringency conditions in these experiments. Nested PCR (see above) was used to verify that the clones giving a positive signal under low stringency annealing conditions were indeed members of this gene family. It is estimated from these studies that there are between 100 and 200 positive clones per haploid genome. The estimate of the size of the family obtain from screens of genomic libraries again represents a lower limit. Given the size of the multigene family, one might anticipate that many of these genes are linked such that a given genomic clone may contain multiple genes. Thus the data from Southern blotting and screens of genomic libraries indicate that the multigene family identified consists of one to several hundred member genes which can be divided into multiple subfamilies.

[0190] It should be noted that the cDNA probes isolated may not be representative of the full complement of subfamilies within the larger family of olfactory proteins. The isolation of cDNAs, for example, relies heavily on PCR with primers from transmembrane domains 2 and 7 and biases our clones for homology within these regions. Thus, estimates of gene number as well as subsequent estimates of RNA abundance should be considered as minimal.

[0191] Expression of the Members of this Multigene Family

[0192] Additional Northern blot analyses were preformed to demonstrate that expression of the members of this gene family is restricted to the olfactory epithelium. (FIG. 8) Northern blot analysis with a mixed probe consisting of the seven divergent cDNAs used above reveals two diffuse bands about 5 and 2 kb in length in olfactory epithelium RNA. This pattern is the same as that seen previously with the mix of 20 DNAs. No annealing is observed to RNA from the brain or retina or other, nonneural tissues, including lung, liver, spleen, and kidney.

[0193] An estimate of the level of expression of this family can be obtained from screens of cDNA libraries. The frequency of positive clones in cDNA libraries made from olfactory epithelium RNA suggests that the abundance of the RNAs in the epithelium is about one in 20,000. The frequency of positive clones is approximately five-fold higher in a cDNA library prepared from RNA from purified olfactory neurons (in which 75% of the cells are olfactory neurons). The increased frequency of positive clones obtained in the olfactory neuron cDNA library is comparable to the enrichment we obtain upon purification of olfactory neurons. These observations suggest that this multigene family is expressed largely, if not solely, in olfactory neurons and may not be expressed in other cell types within the epithelium. If each olfactory neuron contains 10⁵ mRNA molecules, from the frequency of positive clones we predict that each neuron contains only 25-30 transcripts derived from this gene family. Since the family of olfactory proteins consists of a minimum of a hundred genes, a given olfactory neuron could maximally express only a proportion of the many different family members. These values thus suggest that olfactory neurons will exhibit significant diversity at the level of expression of these olfactory proteins.

[0194] Identification of Pheromone Receptors in Vomeronasal Organ

[0195] The vomeronasal organ (vomeronasal gland) is an accessory olfactory structure that is located near the nasal cavity. Like the olfactory epithelium of the vomeronasal organ contains olfactory sensory neurons. The vomeronasal organ is believed to play an important role in the sensing of pheromones in numerous species. Pheromones are believed to have profound effects on both physiological and behavioral aspects of reproduction. The identification of pheromone receptors would permit the identification of the pheromones themselves. It would also enable one to identify agonists or antagonists that would either mimic the pheromones or block the pheromone receptors from transducing pheromone signals. Such information would be important to the development of species specific pesticides and, conversely, to animal husbandry. The identification of pheromone receptors in human could ultimately lead to the development of contraceptives or to treatments for infertility in humans. It is likely that the identification of pheromone receptors in low mammals such as rodents would lead to the identification of similar receptors in human.

[0196] In order to identify potential pheromone receptors, we isolate RNA from the vomeronasal organs of female rats and prepared cDNA from this RNA. The cDNA was subjected to PCR with several different pairs of degenerate oligonucleotide primers that match sequences present in the rat odorant receptor family. The PCR products were subcloned and the nucleotide sequences of the subcloned DNAs were determined. Each of the subcloned DNAs encodes a protein that belongs to the odorant receptor family. The sequences of the following vomeronasal subclones are shown: J1 (SEQ ID NOs: 13 and 14), J2 (SEQ ID NOs: 15 and 16), J4 (SEQ ID NOs: 17 and 18), J7 (SEQ ID NOs: 19 and 20), J8 (SEQ ID NOs: 21 and 22), J11 (SEQ ID NOs: 23 and 24), J14 (SEQ ID NOs: 25 and 26), J15 (SEQ ID NOs: 27 and 28), J16 (SEQ ID NOs: 29 and 30), J17 (SEQ ID NOs: 31 and 32), J19 (SEQ ID NOs: 33 and 34), and J20 (SEQ ID NOs: 35 and 36). In a few cases (J2, J4), the same sequence was amplified with two different primer pairs and the sequence shown is a composite of the two sequences. It is possible that one or more of these molecules, or closely related molecules, serve as pheromone receptors in the rat.

[0197] Discussion

[0198] The mammalian olfactory system can recognize and discriminate a large number of odorous molecules. Perception in this system, as in other sensory systems, initially involves the recognition of external stimuli by primary sensory neurons. This sensory information is then transmitted to the brain where it is decoded to permit the discrimination of different odors. Elucidation of the logic underlying olfactory perception is likely to require the identification of the specific odorant receptors, the analysis of the extent of receptor diversity and receptor specificity, as well as an understanding of the pattern of receptor expression in the olfactory epithelium.

[0199] The odorant receptors are thought to transduce intracellular signals by interacting with G-proteins which activate second messenger systems (12, 13, 14, 15). These proteins are clearly members of the family of G-protein coupled receptors which traverse the membrane seven times (19). The odorant receptors should be expressed specifically in the tissue in which odorant are recognized. The family of olfactory proteins cloned is expressed in the olfactory epithelium. Hybridizing RNA is not detected in brain or retina, or in a host of nonneural tissues. Moreover, expression of this gene family the epithelium may be restricted to olfactory neurons. The family of odorant receptors must be capable of interacting with extremely diverse molecular structures. The genes cloned are members of any extremely large multigene family which exhibit variability in regions thought to be important in ligand binding. The possibility that each member of this large family of seven transmembrane proteins is capable of interacting with only one or a small number of odorant provides a plausible mechanism to accommodate the diversity of odor perception. The properties of the gene family identified suggests that this family is likely to encode a large number of distinct odorant receptors.

[0200] Size of the Multigene Family

[0201] The size of the receptor repertoire is likely to reflect the range of detectable odors and the degree of structural specificity exhibited by the individual receptors. It has been estimated that humans can identify over 10,000 structurally-distinct odorous ligands. However, this does not necessarily imply that humans possess an equally large repertoire of odorant receptors. For example, binding studies in lower vertebrates suggest that structurally-related odorant may activate the same receptor molecules. In fish which smell amino acids, the binding of alanine to isolated cilia can be competed by other small polar residues (threonine and serine), but not by the basic amino acids, lysine or arginine (11). These data suggest that individual receptors are capable of associating with several structurally-related ligands, albeit with different affinities. Stereochemical models of olfactory recognition in mammals (25) (based largely on psychophysical, rather than biophysical data) have suggested existence of several primary odor groups including camphoraceous, musky, peppermint, ethereal, pungent, and putrid. In such a model, each group would contain odorant with common molecular configurations which bind to common receptors and share similar odor qualities.

[0202] Screens of genomic libraries with mixed probes consisting of divergent family members detect approximately 100 to 200 positive clones per genome. The present estimate of at least 100 genes provides only a lower limit since it is likely that the probes used do not detect all of the possible subfamilies. Moreover, it is probable that many of these genes are linked such that a given genomic clone may contain multiple genes. It is therefore expected that the actual size of the gene family may be considerably higher and this family of putative odorant receptors could constitute one of the largest gene families in the genome.

[0203] The characterization of a large multigene family encoding putative odorant receptors suggests that the olfactory system utilizes a far greater number of receptors than the visual system. Color vision, for example, allows the discrimination of several hundred hues, but is accomplished by only three different photoreceptors (1, 2, 3 and 4). The photoreceptors each have different, but overlapping absorption spectra which cover the entire spectrum of visible wavelengths. Discrimination of color results from comparative processing of the information from these three classes of photoreceptors in the brain. Whereas three photoreceptors can absorb light across the entire visible spectrum, our data suggest that a small number of odorant receptors cannot recognize and discriminate the full spectrum of distinct molecular structures perceived by the mammalian olfactory system. Rather, olfactory perception probably employs an extremely large number of receptors each capable of recognizing a small number of odorous ligands.

[0204] Diversity within the Gene Family and the Specificity of Odor Recognition

[0205] The olfactory proteins identified in this application are clearly members of the superfamily of receptors which traverse the membrane seven times. Analysis of the proteins encoded by the 18 distinct cDNAs we have cloned reveals structural features which may render this family particularly well suited for the detection of a diverse array of structurally distinct odorant. Experiments with other members of this class of receptors suggest that the ligand binds to its receptor within the plane of the membrane such that the ligand contacts many, if not all of the transmembrane helices. The family of olfactory proteins can be divided into several different subfamilies which exhibit significant sequence divergence within the transmembrane domains. Nonconservative changes are commonly observed within blocks of residues in transmembrane regions 3, 4, and 5 (FIGS. 4, 5, 6); these blocks could reflect the sites of direct contact with odorous ligands. Some members, for example, have acidic residues in transmembrane domain 3, which in other families are thought to be essential for binding aminergic ligands (20) while other members maintain hydrophobic residues at these positions. This divergence within transmembrane domains may reflect the fact that the members of the family of odorant receptors must associate with odorant of widely different molecular structures.

[0206] These observations suggest a model in which each of the individual subfamilies encode receptors which bind distinct structural classes of odorant. Within a given subfamily, however, the sequence differences are far less dramatic and are often restricted to a small number of residues. Thus, the members of a subfamily may recognize more subtle variations among odor molecules of a given structural class. At a practical level, individual subfamilies may recognize grossly different structures such that one subfamily may associate, for example, with the aromatic compound, benzene and its derivatives, whereas a second subfamily may recognize odorous, short chain, aliphatic molecules. Subtle variations in the structure of the receptors within, for example, the hypothetical benzene subfamily could facilitate the recognition and discrimination of various substituted derivatives such as toluene, xylene or phenol. It should be noted that such a model, unlike previous stereochemical models, does not necessarily predict that molecules with similar structures will have similar odors. The activation of distinct receptors with similar structures could elicit different odors, since perceived odor will depend upon higher order processing of primary sensory information.

[0207] Identification of Odorant Ligands for Members of the Gene Family

[0208] Odorant ligands have been identified for members of the gene family disclosed herein. For the I7 odorant receptor, these include octanal (44, 47), heptanal (45), trans, trans-2,4,-octadienal, tetrahydrocitral, and citronella (44). In addition, a variety of different aliphatic odorants have been described for 14 different odorant receptors (46). One odorant receptor can recognize multiple odorants and one odorant is recognized by multiple odorant receptors, but different odorants are recognized by different combinations of odorant receptors (46).

[0209] Evolution of the Gene Family and the Generation of Diversity

[0210] Preliminary evidence from PCR analyses suggests that members of this family of olfactory proteins are conserved in lower vertebrates as well as invertebrates. This gene family presumably expanded over evolutionary time providing mammals with the ability to recognize an increasing diversity of odorant. Examination of the sequences of the family members cloned from mammals provides some insight into the evolution of this multigene family. Although the chromosomal loci encoding these genes has yet to be characterized, it is likely that at least some member genes will be tandemly arranged in a large cluster as is observed with other large multigene families. A tandem array of this sort provides a template for recombination events including unequal crossing over and gene conversion, that can lead to expansion and further diversification of the sort apparent among the family members we have cloned (26).

[0211] The multigene family encoding the olfactory proteins is large: all of the member genes clearly have a common ancestral origin, but have undergone considerable divergence such that individual genes encode proteins that share from 40-80% amino acid identity. Subfamilies are apparent with groups of genes sharing greater homology among themselves than with members of other subfamilies. Examination of the sequences of even the most divergent subfamilies, however, reveals a pattern in which several blocks of conserved residues are interspersed with variable regions. This segmental homology is conceptually similar to the organization of framework and hypervariable domains within the families of immunoglobulin and T cell receptor variable region sequences (27, 28). This analogy goes beyond structural organization and may extend to the function of these two gene families: each family consists of a large number of genes which have diversified over evolutionary time to accommodate the binding of a highly diverse array of ligands. The evolutionary mechanisms responsible for the diversification and maintenance of these large gene families may also be similar. It has been suggested that gene conversion has played a major role in the evolution of immunoglobulin and T cell receptor variable domains (29, 30 and 31). Analysis of the sequence of the putative olfactory receptors reveals at least one instance where a motif from a variable region of one subfamily is found imbedded in the otherwise divergent sequence of a second subfamily, suggesting that conversion has occurred. Such a mixing of motifs from one subfamily to another over evolutionary time would provide additional combinatorial possibilities leading to the generation of diversity.

[0212] It should be noted, however, that the combinatorial joining of gene segments by DNA rearrangement during development, which is characteristic of immunoglobulin loci (27), is not a feature of the putative odor receptor gene family. No evidence for DNA rearrangement to generate the diversity of genes cloned has been observed. The entire coding region has been sequenced along with parts of the 5′ and 3′ untranslated regions of 10 different cDNA clones. The sequences of the coding regions are all different; no evidence has been obtained for constant regions that would suggest DNA rearrangement of the sort seen in the immune system. The observations indicate that the diversity olfactory proteins are coded by a large number of distinct gene sequences.

[0213] Although it is unlikely from the data that DNA rearrangement is responsible for the generation of diversity among the putative odorant receptors, it remains possible that DNA rearrangements may be involved in the regulation of expression of this gene family. If each olfactory neuron expresses only one or a small number of genes, then a transcriptional control mechanism must be operative to choose which of the more than one hundred genes within the family will be expressed in a given neuron. Gene conversion from one of multiple silent loci into a single active locus, as observed for the trypanosome-variable surface glycoproteins (32), provides one attractive model. The gene conversion event could be stochastic, such that a given neuron could randomly express any one of several hundred receptor genes, or regulated (perhaps by positional information) such that a given neuron could only express one or a small number of predetermined receptor types. Alternatively, it is possible that positional information in the olfactory epithelium controls the expression of the family of olfactory receptors by more classical mechanisms that do not involve DNA rearrangement. What ever mechanisms will regulate the expression of receptor genes within this large, multigene family, these mechanisms must accommodate the requirement that olfactory neurons are regenerated every 30-60 days (8) and therefore the expression of the entire repertoire of receptors must be accomplished many times during the life of an organism.

[0214] Receptor Diversity and the Central Processing of Olfactory Information

[0215] The results suggest the existence of a large family of distinct odorant receptors. Individual members of this receptor family are likely to be expressed by only a small set of the total number of olfactory neurons. The primary sensory neurons within the olfactory epithelium will therefore exhibit significant diversity at the level of receptor expression. The question then emerges as to whether neurons expressing the same receptors are localized in the olfactory epithelium. Does the olfactory system employ a topographic map to discriminate among the numerous odorant? The spatial organization of distinct classes of olfactory sensory neurons, as defined by receptor expression, can now be determined by using the procedures of in situ hybridization and immunohistochemistry with probes specific for the individual receptor subtypes. This information should help to distinguish between different models that have been proposed to explain the coding of diverse odorant stimuli (33).

[0216] In one model, sensory neurons that express a given receptor and respond to a given odorant may be localized within defined positions within the olfactory epithelium. This topographic arrangement would also be reflected in the projection of olfactory sensory axons into discrete regions (glomeruli) within the olfactory bulb. In this scheme, the central coding to permit the discrimination of discrete odorant would depend, in part, on the spatial segregation of different receptor populations. Attempts to discern the topographic localization of specific receptors at the level of the olfactory epithelium has led to conflicting results. In some studies, electrophysiological recordings have revealed differences in olfactory responses to distinct odorant in different regions of the olfactory epithelium (34, 35). However, these experiments have been difficult to interpret since the differences in response across the epithelium are often small and are not observed in all studies (36).

[0217] A second model argues that sensory neurons expressing distinct odorant receptors are randomly distributed in the epithelium but that neurons responsive to a given odorant project to restricted regions within the olfactory bulb. In this instance, the discrimination of odors would be a consequence of the position of second order neurons in the olfactory bulb, but would be independent of the site of origin of the afferent signals within the epithelium. Mapping of the topographic projections of olfactory neurons has been performed by extracellular recordings from different regions of the bulb (37, 38) and by 2-deoxyglucose autoradiography to map regional activity after exposure to different odorant (39). These studies suggest that spatially-localized groups of bulbar neurons preferentially respond to different odorant. The existence of specific odorant receptors, randomly distributed through the olfactory epithelium, which converge on a common target within the olfactory bulb, would raise additional questions about the recognition mechanisms used to guide these distinct axonal subsets to their central targets.

[0218] Other sensory systems also spatially segregate afferent input from primary sensory neurons. The spatial segregation of information employed, for example, by the visual and somatosensory systems, is used to define the location of the stimulus within the external environment as well as to indicate the quality of the stimulus. In contrast, olfactory processing does not extract spatial features of the odorant stimulus. Relieved of the necessity to encode information about the spatial localization of the sensory stimulus, it is possible that the olfactory system of mammals uses the spatial segregation of sensory input solely to encode the identity of the stimulus itself. The molecular identification of the genes likely to encode a large family of olfactory receptors should provide initial insights into the underlying logic of olfactory processing in the mammalian nervous system.

REFERENCES

[0219] 1. Rushton, W. A. H. (1965) A Foveal pigment in the deuteranope. J. Physiol. 176, 24-37.

[0220] 2. Rushton, W. A. H. (1955) Foveal photopigments in normal and colour-blind. J. Physiol. 129, 41-42.

[0221] 3. Wald, G., Brown, P. K., and Smith, P. H. (1955) Iodopsin. J. Gen. Physiol. 38, 623-681.

[0222] 4. Nathans, J., Thomas, D., and Hogness, D. S. (1986) Molecular genetics of human color vision: The genes encoding blue, green and red pigments. Science 232, 193-202.

[0223] 5. Lancet, D. (1986) Vertebrate olfactory reception. Ann. Rev. Neurosci. 9, 329-355.

[0224] 6. Reed, R.R. (1990) How does the nose know? Minireview. Cell 60, 1-2.

[0225] 7. Moulton, D. G., and Beidler, L. M. (1967) Structure and function in the peripheral olfactory system. Physiol. Rev. 47, 1-52.

[0226] 8. Graziadei, P. P. C., and Monti Graziadei, G. A. (1979) Neurogenesis and neuron regeneration in the olfactory system of mammals. I. Morphological aspects of differentiation and structural organization of the olfactory sensory neurons. J. Neurocytol. 8, 1-18.

[0227] 9. Bronshtein, A. A., and Minor, A. V. (1977) Regeneration of olfactory flagella and restoration of the electroolfactogram following application of triton X-100 to the olfactory mucosa of frogs. Tsitologiia 19, 33-39.

[0228] 10. Rhein, L. D., and Cagan, R. H. (1980) Biochemical studies of olfaction: Isolation, characterization, and odorant binding activity of cilia from rainbow trout olfactory rosettes. Proc. Natl. Acad. Sci. USA 77, 4412-4416.

[0229] 11. Rhein, L. D., and Cagan, R. H. (1983) Biochemical studies of olfaction: Binding specificity of odorant to a cilia preparation from rainbow trout olfactory rosettes. J. Neurochem. 41, 569-577.

[0230] 12. Pace, U., Hanski, E., Salomon, Y., and Lancet, D. (1985) Odorant-sensitive adenylate cyclase may mediate olfactory reception. Nature 316, 255-258.

[0231] 13. Sklar, P. B., Anholt, R. R. H.; and Snyder, S. H. (1986) The odorant-sensitive adenylate cyclase of olfactory receptor cells: Differential stimulation by distinct classes of odorant. J. Biol. Chem. 261, 15538-15543.

[0232] 14. Breer, H., Boekhoff, I., and Tareilus, E. (1990) Rapid kinetics of second messenger formation in olfactory transduction. Nature 345, 65-68.

[0233] 15. Boekhoff, I., Tareilus, E., Strotmann, J., and Breer, H. (1990). Rapid activation of alternative second messenger pathways in olfactory cilia from rats by different odorant. EMBO J. 9, 2453

[0234] 16. Jones, D. T., and Reed, R. R. (1989) G_(olf): an olfactory neuron specific-G-protein involved in odorant signal transduction. Science 244, 790-795.

[0235] 17. Nakamura, T. and Gold G. (1987) A cyclic nucleotide-gated conductance in olfactory receptor cilia. Nature 325, 442-444.

[0236] 18. Dhallan, R. S., Yau, K. -W., Schrader, K. A., and Reed, R. R. (1990) Primary structure and functional expression of a cyclic nucleotide-activated channel from olfactory neurons. Nature 347, 184-187.

[0237] 19. O'Dowd, B. F., Felkowitz, R. J., and Caron, M. G. (1989) Structure of the adrenergic and related receptors. Ann. Rev. Neurosci. 12, 67-83.

[0238] 20. Strader, C. D., Sigal, I. S., and Dixon, R. F. (1989) Structural basis of adrenergic receptor function FASEB J. 3, 1825-1832.

[0239] 21. O'Down, B. F., Hnatowich, M., Caron, M. G., Lefkowitz, R. J., and Bouvier, M. (1989) Palmitoylation of the human β₂-adrenergic receptor. Mutation of CYS³⁴¹ in the carboxyl tail leads to an uncoupled nonpalymitoylated form of the receptor. J. Biol. Chem. 264, 7564-7569.

[0240] 22. Kobilka, B. KI., Kobilka, T. S., Daniel, K., Regan, J. W., Caron, M. G., and Lefkowitz, R. J. (1988) Chimeric α₂-β₂-ldrenergic receptors: delineation of domains involved in effector coupling and ligand binding specificity. Science 240, 1310-1316.

[0241] 23. Hamm, H. E., Deretic, D., Arendt, A., Hargrove, P. A., Koenig, B., and Hofmann, K. P. (1988) Site of G protein binding to rhodopsin mapping with synthetic peptides to the a subunit. Science 241, 832-835. (1988).

[0242] 24. Bouvier, M. W., Hausdorff, A., De Blasi, A., O'Dowd, B. F., Kobilka, B. K., Caron, M. G., and Lefkowitz, R. J. (1988) Removal of phosphorylation sites from the β-adrenergic receptor delays the onset of agonist-promoted desensitization. Nature (London) 333, 370-373.

[0243] 25. Amoore, J. E. (1982) Odor Theory and Odor Classification, In Fragrance Chemistry (New York: Academic Press, Inc.), pp. 27-73.

[0244] 26. Maeda, N., and Smithies, 0. (1986) The evolution of multigene families: Human haptoglobin genes. Ann. Rev. Genet. 20, 81-108.

[0245] 27. Tonegawa, S. (1983) Somatic generation of antibody diversity. Nature 302, 575-581.

[0246] 28. Hood, L., Kronenberg, M., and Hunkapiller, T. (1985) T cell antigen receptors and the immunoglobulin supergene family. Cell 40, 225-229.

[0247] 29. Baltimore, D. (1981) Gene conversion: Some implications for immunoglobulin genes. Minireview. Cell 24, 592-594.

[0248] 30. Egel, R. (1981) Intergenic conversion and reiterated genes. Nature 290, 191-192.

[0249] 31. Flanagan, J. G., Lefranc, M. -P., and Rabbitts, T. H. (1984) Mechanisms of divergence and convergence of the human immunoglobulin α1 and α2 constant region gene sequences. Cell 36, 681-688.

[0250] 32. Van der Ploeg, L. H. T. (1991) Antigenic variation in African trypanosomes: genetic recombination and transcriptional control of VSG genes. In Gene Rearrangement. (New York: Oxford University Press), pp 51-98.

[0251] 33. Shepherd, G. M. (1985) Are there labeled lines in the olfactory pathway? In Taste, Olfaction and the Central Nervous System (New York: The Rockefeller University Press), pp. 307-321.

[0252] 34. Mackay-Sim, A., Shaman, P., and Moulton, D. G. (1982) Topographic coding of olfactory quality: odorant-specific patterns of epithelial responsivity in the Salamander. J. Neurophysiol. 48, 584-596.

[0253] 35. Thommesen, G., and Doving, K. B. (1977) Spatial distribution of the EOG in the rat. A variation with odour stimulation. Acta Physiol. Scand. 99, 270-280.

[0254] 36. Sicard, G. (1985) Olfactory discrimination of structurally related molecules: Receptor cell responses to camphoraceous odorant. Brain Res. 326, 203-212.

[0255] 37. Thommesen, G. (1978) The spatial distribution of odour induced potentials in the olfactory bulb or char and trout (Salmonidae). Acta Physiol. Scand. 102, 205-217.

[0256] 38. Doving, K. B., Selset, R., and Thommesen, G. (1980) Olfactory sensitivity to bile acids in salmonid fishes. Acta Physiol. Scand. 108, 123-131.

[0257] 39. Stewart, W. B., Kauer, J. S., and Shepherd, G. M. (1979) Functional organization of the rat olfactory bulb analyzed by the 2-deoxyglucose method. J. Comp. Neurol. 185, 715-734.

[0258] 40. Chirgwin, J. M., Przbyla, A. E., MacDonald, R. J., and Rutter, W. J. (1979) Isolation of biologically active ribonucleic acid from sources enriched in ribonuclease. Biochemistry 18, 5294-5299.

[0259] 41. Maniatis, T., Fritsch, E., and Sambrook, J. (1982) Molecular Cloning: A Laboratory Manual (Cold Spring Harbor, New York: Cold Spring Harbor Laboratory Press).

[0260] 42. Sambrook, J., Fritsch, E. F., and Maniatis, T. (1989) Molecular Cloning: A Laboratory Manual, Second Edition (Cold Spring Harbor, New York, Cold Spring Harbor Laboratory Press).

[0261] 43. Julius et al. (1988) Molecular Charatization of a Functional cDNA Encoding a Seritonin 1c Receptor. Science 241, 558-564.

[0262] 44. Araneda R. C., Kini A. D., and Firestein S. (2000) The molecular receptive range of an odorant receptor. Nature Neuroscience 3(12), 1248-1255.

[0263] 45. Krautwurst D., Yau K. -W., and Reed R. R. (1998) Identification of ligands for olfactory receptors by functional expression of a receptor library. Cell 95, 917-926.

[0264] 46. Malnic B., Hirono J., Sato T., and Buck, L. B. (1999) Combinatorial receptor codes for odors. Cell 96, 713-723.

[0265] 47. Zhao H., Ivic L., Otaki J. M., Hashimoto M., Mikoshiba K., and Firestein S. (1998) Functional expression of a mammalian odorant receptor. Science 279, 237-242.

1 80 1 954 DNA Rattus sp. F12 1 atggaatcag ggaacagcac aagaagattt tcaagttttt ttcttcttgg atttacagaa 60 aacccacaac ttcacttcct catttttgca ctattcctgt ccatgtacct ggtaacagtg 120 cttgggaacc tgcttatcat tatggccatc atcacacagt ctcatttgca tacacccatg 180 tactttttcc ttgctaacct atcctttgtg gacatctgtt tcacctccac caccatccca 240 aagatgttgg taaatatata cacccagagc aagagcatca cctatgaaga ctgtattagc 300 cagatgtgtg tcttcttggt tttcgcagaa ttgggcaact ttctcctggc tgtgatggcc 360 tatgaccgat atgtggctaa ctgtcaccca ctgtgttaca cagtcattgt gaaccaccgg 420 ctctgtatcc tgctgcttct gctgtcctgg gttatcagca ttttccatgc cttcatacag 480 agcttaattg tgctacagtt gaccttctgt ggagatgtga aaatccctca cttcttctgt 540 gaacttaatc agctgtccca actcacctgt tcagacaact ttccaagtca cctcataatg 600 aatcttgtac ctgttatgtt ggcagccatt tccttcagtg gcatccttta ctcttatttc 660 aagatagtat cctccataca ttctatctcc acagttcagg ggaagtacaa ggcattttct 720 acttgtgcct ctcacctttc cattgtctcc ttattttata gtacaggcct cggagtgtac 780 gtcagttctg ctgtggtcca aagctcacat tctgctgcaa gtgcttcggt catgtatact 840 gtggtcaccc ccatgctgaa ccccttcatt tatagtctaa ggaataaaga tgtgaagaga 900 gctctggaaa gactgttaga aggaaactgt aaagtgcatc attggactgg atga 954 2 1002 DNA Rattus sp. F3 2 atggactcaa gcaacaggac aagagtttca gaatttcttc ttcttggatt tgtagaaaac 60 aaagacctac aaccccttat ttatggtctt tttctctcta tgtacctggt tactgtcatt 120 ggaaacatat ccattattgt ggctatcatt tcagatccct gtctgcacac ccccatgtat 180 ttcttcctct ctaacctgtc ctttgtggac atctgtttca tttcaaccac tgttccaaag 240 atgttagtga acatccagac ccaaaacaat gtcatcacct atgcaggatg cattacccag 300 atatactttt tcttgctctt tgtagaattg gacaacttct tgctgactat catggcctat 360 gaccgttacg tagccatctg tcaccccatg cactacacag ttatcatgaa ctacaagctc 420 tgtggatttc tggttctggt atcttggatt gtaagtgttc tgcatgcctt gtttcaaagc 480 ttgatgatgt tggcgctgcc cttctgcaca catctggaaa tcccacacta cttctgtgaa 540 cctaatcagg tgattcaact cacctgttct gatgcatttc ttaatgatct tgtgatatat 600 tttacacttg tgctgctggc tactgttcct cttgctggca tcttctattc ttacttcaag 660 atagtgtcct ccatatgtgc tatatcgtca gttcatggga agtacaaagc attctccacc 720 tgtgcatctc acctttcagt cgtgtcttta ttttactgca caggactagg agtgtacctc 780 agttctgctg caaacaacag ctcacaggca agtgccacag cctcagtcat gtacactgta 840 gttaccccta tggtgaaccc ttttatctat agtcttagga ataaagatgt taagagtgtt 900 ctgaaaaaaa ctctttgtga ggaagttata aggagtccac cttccctact tcatttcttc 960 ctagtgttat gtcatctccc ttgttttatt ttttgttatt aa 1002 3 942 DNA Rattus sp. F5 3 atgagcagca ccaaccagtc cagtgtcacc gagttcctcc tcctgggact ctccaggcag 60 ccccagcagc agcagctcct cttcctgctc ttcctcatca tgtacctggc cactgtcctg 120 ggaaacctgc tcatcatcct ggctattggc acagactccc gcctgcacac ccccatgtac 180 ttcttcctca gtaacctgtc ctttgtggat gtctgcttct cctctaccac tgtccctaaa 240 gttctggcca accatatact tgggagtcag gccatttcct tctctgggtg tctcacccag 300 ctgtattttc tcgctgtgtt tggtaacatg gacaatttcc tgctggctgt gatgtcctat 360 gaccgatttg tggccatatg ccacccttta cactacacaa caaagatgac ccgtcagctc 420 tgtgtcctgc ttgttgtggg gtcatgggtt gtagccaaca tgaattgtct gttgcacata 480 ctgctcatgg ctcgactctc cttctgtgca gacaacatga tcccccactt cttctgtgat 540 ggaactcccc tcctgaaact ctcctgctca gacacacatc tcaatgagct gatgattctt 600 acagagggag ctgtggtcat ggtcacccca tttgtctgca tcctcatctc ctacatccac 660 atcacctgtg ctgtcctcag agtctcatcc cccaggggag gatggaaatc cttctccacc 720 tgtggctccc acctggctgt ggtctgcctc ttctatggca ccgtcatcgc tgtgtatttc 780 aacccatcat cctctcactt agctgggagg gacatggcag ctgcagtgat gtatgcagtg 840 gtgaccccaa tgctgaaccc tttcatctat agcctgagga acagcgacat gaaagcagct 900 ttaaggaaag tgctcgccat gagatttcca tctaagcagt aa 942 4 936 DNA Rattus sp. F6 4 atggcttgga gtactggcca gaacctgtcc acaccaggac cattcatctt gctgggcttc 60 ccagggccaa ggagcatgcg cattgggctc ttcctgcttt tcctggtcat gtatctgctt 120 acggtagttg gaaacctagc catcatctcc ctggtaggtg cccacagatg cctacagaca 180 cccatgtact tcttcctctg caacctctcc ttcctggaga tctggttcac cacagcctgc 240 gtacccaaga ccctggccac atttgcgcct cggggtggag tcatttcctt ggctggctgt 300 gccacacaga tgtactttgt cttttctttg ggctgtaccg agtacttcct gctggctgtg 360 atggcttatg accgctacct ggccatctgc ctgccactgc gctatggtgg catcatgact 420 cctgggctgg cgatgcggtt ggccctggga tcctggctgt gtgggttttc tgcaatcaca 480 gttcctgcta ccctcattgc ccgcctctct ttctgtggct cacgtgtcat caaccacttc 540 ttctgtgaca tttcgccctg gatagtgctt tcctgcaccg acacgcaggt ggtggaactg 600 gtgtcctttg gcattgcctt ctgtgttatt ctgggctcgt gtggtatcac actagtctcc 660 tatgcttaca tcatcactac catcatcaag attccctctg cccggggccg gcaccgcgcc 720 ttctcaacct gctcatccca tctcactgtg gtgctgattt ggtatggctc caccatcttc 780 ttgcatgtga ggacctcggt agagagctcc ttggacctca ccaaagctat cacagtgctc 840 aacaccattg tcacacctgt gctgaaccct ttcatatata ctctgaggaa caaggatgtc 900 aaggaagctc tgcgcaggac ggtgaagggg aagtga 936 5 939 DNA Rattus sp. I14 5 atgactggaa ataaccaaac tttgatcttg gagttcctcc tcctgggtct gcccatccca 60 tcagagtatc atctcctgtt ctatgccctg ttcctggcca tgtacctcac catcatcctg 120 ggaaacctgc taatcattgt ccttgttcga ctggactctc atctccacat gcccatgtac 180 ttgtttctca gcaacttgtc cttctctgac ctctgctttt cctctgtcac aatgcccaaa 240 ttgcttcaga acatgcagag ccaagtacca tctatatcct atacaggctg cctgacacag 300 ctgtacttct ttatggtttt tggagatatg gagagcttcc ttcttgtggt catggcctat 360 gaccgctatg tggccatttg ctttcctttg cgttacacca ccatcatgag caccaagttc 420 tgtgcttcac tagtgctact tctgtggatg ctgacgatga cccatgccct gctgcatacc 480 ctactcattg ctagattgtc tttttgtgag aagaatgtga ttcttcactt tttctgtgac 540 atttctgctc ttctgaagtt gtcctgctca gacatttatg ttaatgagct gatgatatat 600 atcttgggtg gactcatcat tattatccca ttcctattaa ttgttatgtc ctatgttaga 660 attttcttct ccattttgaa gtttccatct attcaggaca tctacaaggt attctcaacc 720 tgtggttccc atctgtctgt ggtgaccttg ttttatggga caatttttgg tatctactta 780 tgtccatcag gtaataattc tactgtgaag gagattgcca tggctatgat gtacacagtg 840 gtgactccca tgctgaatcc cttcatctac agcctgagga acagagacat gaaaagggcc 900 ctaataagag ttatctgcac taagaaaatc tctctgtaa 939 6 945 DNA Rattus sp. I15 6 atgacagaag agaaccaaac tgtgatctcc cagttccttc tccttttcct gcccatcccc 60 tcagagcacc agcacgtgtt ctacgccctg ttcctgtcca tgtacctcac cactgtcctg 120 gggaacctca tcatcatcat cctcattcac ctggactccc atctccacac acccatgtac 180 ttgtttctca gcaacttgtc cttctctgat ctctgctttt cctctgttac gatgcccaag 240 ttgttgcaga acatgcagag ccaagttcca tccatcccct ttgcaggctg cctgacacaa 300 ttatactttt acctgtattt tgcagacctt gagagcttcc tgcttgtggc catggcctat 360 gaccgctatg tggccatctg cttccccctt cattacatga gcatcatgag ccccaagctc 420 tgtgtgagtc tggtggtgct gtcctgggtg ctgaccacct tccatgccat gctgcacacc 480 ctgctcatgg ccagattgtc attctgtgcg gacaatatga tcccccactt tttctgtgat 540 atatctcctt tattgaaact gtcctgctct gacacgcatg ttaatgagtt ggtgatattt 600 gtcatgggag ggcttgttat tgtcattcca tttgtgctca tcattgtatc ttatgcacga 660 gttgtcgcct ccattcttaa agtcccttct gtccgaggca tccacaagat cttctccacc 720 tgcggctccc atctgtctgt ggtgtcactg ttctatggga caatcattgg tctctactta 780 tgtccgtcag ctaataactc tactgtgaag gagactgtca tggccatgat gtacacagtg 840 gtgaccccca tgctgaaccc cttcatctac agcctgagga acagagacat gaaagaggca 900 ctgataagag tcctttgtaa aaagaaaatt accttctgtc tatga 945 7 933 DNA Rattus sp. I3 7 atgaacaatc aaactttcat cacccaattc cttctcctgg gactgcccat ccctgaagaa 60 catcagcacc tgttctatgc cttgttcctg gtcatgtacc tcaccaccat cttgggaaac 120 ttgctaatca ttgtacttgt tcaactggac tcccagctcc acacacctat gtatttgttt 180 ctcagcaatt tgtctttctc tgatctatgt ttttcctctg tcacaatgcc caagctgctg 240 cagaacatga ggagccagga cacatccatt ccctatggag gctgcctggc acaaacatac 300 ttctttatgg tttttggaga tatggagagt ttccttcttg tggccatggc ctatgaccgc 360 tatgtggcca tctgcttccc tctgcattac accagcatca tgagccccaa gctctgtact 420 tgtctagtgc tgttattgtg gatgctgacg acatcccatg ccatgatgca cacactgctt 480 gcagcaagat tgtctttttg tgagaacaat gtggtcctca acttcttctg tgacctattt 540 gttctcctaa agctggcctg ctcagacact tatattaatg agttgatgat atttatcatg 600 agtacactcc tcattattat tccattcttc ctcattgtta tgtcctatgc aaggatcata 660 tcctctattc ttaaggttcc atctacccaa ggcatctgca aggtcttctc tacctgtggt 720 tcccatctgt ctgtagtatc actgttctat gggacaatta ttggtctcta cttatgtcca 780 gcaggtaata attccactgt aaaagagatg gtcatggcca tgatgtacac tgtggtgacc 840 cccatgctga atcccttcat ctacagccta aggaatagag atatgaagag ggccctaata 900 agagttatct gtagtatgaa aatcactctg taa 933 8 984 DNA Rattus sp. I7 8 atggagcgaa ggaaccacag tgggagagtg agtgaatttg tgttgctggg tttcccagct 60 cctgccccac tgcgagtact actatttttc ctttctcttc tggactatgt gttggtgttg 120 actgaaaaca tgctcatcat tatagcaatt aggaaccacc caaccctcca caaacccatg 180 tattttttct tggctaatat gtcatttctg gagatttggt atgtcactgt tacgattcct 240 aagatgctcg ctggcttcat tggttccaag gagaaccatg gacagctgat ctcctttgag 300 gcatgcatga cacaactcta ctttttcctg ggcttgggtt gcacagagtg tgtccttctt 360 gctgtgatgg cctatgaccg ctatgtggct atctgtcatc cactccacta ccccgtcatt 420 gtcagtagcc ggctatgtgt gcagatggca gctggatcct gggctggagg ttttggtatc 480 tccatggtta aagttttcct tatttctcgc ctgtcttact gtggccccaa caccatcaac 540 cactttttct gtgatgtgtc tccattgctc aacctgtcat gcactgacat gtccacagca 600 gagcttacag actttgtcct ggccattttt attctgctgg gaccgctctc tgtcactggg 660 gcatcctaca tggccatcac aggtgctgtg atgcgcatcc cctcagctgc tggccgccat 720 aaagcctttt caacctgtgc ctcccacctc actgttgtga tcatcttcta tgcagccagt 780 attttcatct atgccaggcc taaggcactc tcagcttttg acaccaacaa gctggtctct 840 gtactctacg ctgtcattgt accgttgttc aatcccatca tctactgctt gcgcaaccaa 900 gatgtcaaaa gagcgctacg tcgcacgctg cacctggccc aggaccagga ggccaatacc 960 aacaaaggca gcaaaattgg ttag 984 9 939 DNA Rattus sp. I8 9 atgaacaaca aaactgtcat cacccatttc ctcctcctgg gattgcccat ccccccagag 60 caccagcaac tgttctttgc cctgttcctg atcatgtacc tcaccacctt tctgggaaac 120 ctgctaattg ttgtccttgt tcaactggac tctcatctcc acacacccat gtacttgttt 180 ctcagcaact tgtccttctc tgatctctgc ttttcctctg ttacaatgct gaaattgctg 240 caaaatatac agagccaagt accatctata tcctatgcag gatgcctgac acagatattc 300 ttctttttgt tgtttggcta ccttgggaat ttccttcttg tagccatggc ctatgaccgc 360 tatgtggcca tctgcttccc tctgcattat accaacatca tgagccataa gctctgtact 420 tgtctcctgc tggtattttg gataatgaca tcatctcatg ccatgatgca caccctgctt 480 gcagcaagat tgtctttttg tgagaacaat gtactcctca actttttctg tgacctgttt 540 gttctcctaa agttggcctg ctcagacact tatgttaatg agttgatgat acatatcatg 600 ggcgtgatca tcattgttat tccattcgtg ctcattgtta tatcctatgc caagatcatc 660 tcctccattc ttaaggttcc atctactcaa agcattcaca aggtcttctc cacttgtggt 720 tctcatctct ctgtggtgtc tctgttctac gggacaatta ttggtctcta tttatgtcca 780 tcaggtgata attttagtct aaaggggtct gccatggcta tgatgtacac agtggtaact 840 ccaatgctga acccgttcat ctacagccta agaaacagag acatgaagca ggccctaata 900 agagttacct gtagcaagaa aatctctctg ccatggtag 939 10 945 DNA Rattus sp. I9 10 atgactagaa gaaaccaaac tgccatctct cagttcttcc ttctgggcct gccattcccc 60 ccagagtacc aacacctgtt ctatgccctg ttcctggcca tgtacctcac cactctcctg 120 gggaacctca tcatcatcat cctcattcta ctggactccc atctccacac acccatgtac 180 ttgtttctca gcaatttatc ctttgccgac ctctgttttt cctctgtcac aatgcccaag 240 ttgttgcaga acatgcagag ccaagttcca tccatcccct atgcagggtg cctggcacag 300 atatacttct ttctgttttt tggagacctt ggaaacttcc tgcttgtggc catggcctat 360 gaccgctatg tggccatctg cttccccctt cattacatga gcatcatgag ccccaagctc 420 tgtgtgagtc tggtggtgct gtcctgggtg ctgactacct tccatgccat gctgcacacc 480 ctgctcatgg ccagattgtc attctgtgag gacagtgtga tccctcacta tttctgtgat 540 atgtctactc tgctgaaagt ggcttgttct gacacccatg ataatgaatt agcaatattt 600 atcttagggg gccctatagt tgtactacct ttccttctca tcattgtttc ttatgcaaga 660 attgtttcct ccatcttcaa ggtcccttct tctcaaagca tccataaagc cttctccacc 720 tgtggctccc acctgtctgt ggtgtcactg ttctatggga cagtcattgg tctctactta 780 tgtccttcag ctaataactc cactgtgaag gagactgtca tgtctttgat gtacacaatg 840 gtgacaccca tgctgaaccc cttcatctac agcctaagaa acagagacat aaaagatgca 900 ttagaaaaaa taatgtgcaa aaagcaaatt ccctcctttc tatga 945 11 645 DNA Homo Sapiens H5 misc_feature ()..() n = unknown 11 atctgttttg tgtctaccac tgtcccaaag cagctggtga acatccagac acagagcaga 60 gtcatcacct atgcagactg catcacccag atgtgctttt ttatactctt tgtagtgttg 120 gacagcttac tcctgactgt gatggcctat gaccggtttg tggccatctg tcaccccctg 180 cactacacag tcattatgag ctcctggctc tgtggactgc tggttctggt gtcctggatc 240 gtgagcatcc tatattctct gttacaaagc ataatggcat tgcagctgtc cttctgtaca 300 gaactgaaaa tccctcaatt tttctgtgaa cttaatcagg tcatccacct tgcctgttcc 360 gacactttta ttaatgacat gatgatgaat tttacaagtg tgctgctggg tgggggatgc 420 ctcgctggaa tattttactn ntactttaag atactttgtt gcatatgttc gatctcatca 480 gctcagggga tgaataaagc actttccacc tgtgcatctc acctctcagt tgtctcctta 540 ttttattgta caggcgtagg tgtgtacctt agttctgctg caacccataa ctcactctca 600 aatgctgcag cctcggtgat gtacactgtg gtcacctcca tgctg 645 12 215 PRT Homo Sapiens H5 UNSURE (147)..(147) x = unknown 12 Ile Cys Phe Val Ser Thr Thr Val Pro Lys Gln Leu Val Asn Ile Gln 1 5 10 15 Thr Gln Ser Arg Val Ile Thr Tyr Ala Asp Cys Ile Thr Gln Met Cys 20 25 30 Phe Phe Ile Leu Phe Val Val Leu Asp Ser Leu Leu Leu Thr Val Met 35 40 45 Ala Tyr Asp Arg Phe Val Ala Ile Cys His Pro Leu His Tyr Thr Val 50 55 60 Ile Met Ser Ser Trp Leu Cys Gly Leu Leu Val Leu Val Ser Trp Ile 65 70 75 80 Val Ser Ile Leu Tyr Ser Leu Leu Gln Ser Ile Met Ala Leu Gln Leu 85 90 95 Ser Phe Cys Thr Glu Leu Lys Ile Pro Gln Phe Phe Cys Glu Leu Asn 100 105 110 Gln Val Ile His Leu Ala Cys Ser Asp Thr Phe Ile Asn Asp Met Met 115 120 125 Met Asn Phe Thr Ser Val Leu Leu Gly Gly Gly Cys Leu Ala Gly Ile 130 135 140 Phe Tyr Xaa Tyr Phe Lys Ile Leu Cys Cys Ile Cys Ser Ile Ser Ser 145 150 155 160 Ala Gln Gly Met Asn Lys Ala Leu Ser Thr Cys Ala Ser His Leu Ser 165 170 175 Val Val Ser Leu Phe Tyr Cys Thr Gly Val Gly Val Tyr Leu Ser Ser 180 185 190 Ala Ala Thr His Asn Ser Leu Ser Asn Ala Ala Ala Ser Val Met Tyr 195 200 205 Thr Val Val Thr Ser Met Leu 210 215 13 640 DNA Rattus sp. J1 misc_feature ()..() n = unknown 13 catctgcttt acttctgcta gcatcccaaa gatgctagtg aatatacaga cgaagaacaa 60 ggtgatcacc tatgaaggct gcatctccca agtatacttt tcatactctt tggagttttg 120 gacaactttc ttctcgactg tgatggccta tgaccgatat gtggccatct gtcacccatc 180 tnactacaca ggtcatcatg aaccnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 240 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 300 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 360 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 420 nnnnnnnntt tattcttact ctaagatagt ttcctccata cgagaaatct catcatcaca 480 gggaaagtac aagnnattct ccacctgtgc atcccacctc tcagttgttt cattattcta 540 ttctacactt ttgggtgtgt accttagttc ttcttttacc caaaactcac actcaactgc 600 acgggcatct gttatgtaca gtgtggtcac ccccatgttg 640 14 213 PRT Rattus sp. J1 UNSURE (61)..(165) x = unknown 14 Ile Cys Phe Thr Ser Ala Ser Ile Pro Lys Met Leu Val Asn Ile Gln 1 5 10 15 Thr Lys Asn Lys Val Ile Thr Tyr Glu Gly Cys Ile Ser Gln Val Tyr 20 25 30 Phe Ser Tyr Ser Leu Glu Phe Trp Thr Thr Phe Phe Ser Thr Val Met 35 40 45 Ala Tyr Asp Arg Tyr Val Ala Ile Cys His Pro Ser Xaa Tyr Thr Gly 50 55 60 His His Glu Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 65 70 75 80 Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 85 90 95 Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 100 105 110 Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 115 120 125 Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Tyr 130 135 140 Ser Tyr Ser Lys Ile Val Ser Ser Ile Arg Glu Ile Ser Ser Ser Gln 145 150 155 160 Gly Lys Tyr Lys Xaa Phe Ser Thr Cys Ala Ser His Leu Ser Val Val 165 170 175 Ser Leu Phe Tyr Ser Thr Leu Leu Gly Val Tyr Leu Ser Ser Ser Phe 180 185 190 Thr Gln Asn Ser His Ser Thr Ala Arg Ala Ser Val Met Tyr Ser Val 195 200 205 Val Thr Pro Met Leu 210 15 636 DNA Rattus sp. J2 15 acctccacca ccatcccaaa gatgctggta aatatacaca cccagagcaa tactatcacc 60 tatgaagact gtatttccca gatgtttgta ctcttggttt ttggagaact ggacaacttt 120 ctcctggctg tgatggccta tgatcgatat gtggctatct gtcacccact gtattacaca 180 gtcattgtga accaccgact ctgtatcctg ctgcttctgc tgtcctgggt tgtcagcatt 240 ttacatgcct tcttacagag cttaattgta ctacagttga ccttctgtgg agatgtgaaa 300 atccctcact tcttctgtga gctcaatcag ctgtcccaac tcacatgttc agacaacttt 360 ccaagtcacc tcacaatgca tcttgtacct gttatatttg cagctatttc cctcagtggt 420 atcctttact cttatttcaa gatagtgtct tccatacgtt ctatgtcctc agttcaaggg 480 aagtacaagg cattttctac atgtgcctct cacctttcca ttgtctcctt attttatagt 540 acaggcctcg gggtgtacgt cagttctgct gtgatccgaa gctcacactc ctctgcaagt 600 gcttcggtca tgtatactgt ggtcaccccc atgttg 636 16 212 PRT Rattus sp. J2 16 Thr Ser Thr Thr Ile Pro Lys Met Leu Val Asn Ile His Thr Gln Ser 1 5 10 15 Asn Thr Ile Thr Tyr Glu Asp Cys Ile Ser Gln Met Phe Val Leu Leu 20 25 30 Val Phe Gly Glu Leu Asp Asn Phe Leu Leu Ala Val Met Ala Tyr Asp 35 40 45 Arg Tyr Val Ala Ile Cys His Pro Leu Tyr Tyr Thr Val Ile Val Asn 50 55 60 His Arg Leu Cys Ile Leu Leu Leu Leu Leu Ser Trp Val Val Ser Ile 65 70 75 80 Leu His Ala Phe Leu Gln Ser Leu Ile Val Leu Gln Leu Thr Phe Cys 85 90 95 Gly Asp Val Lys Ile Pro His Phe Phe Cys Glu Leu Asn Gln Leu Ser 100 105 110 Gln Leu Thr Cys Ser Asp Asn Phe Pro Ser His Leu Thr Met His Leu 115 120 125 Val Pro Val Ile Phe Ala Ala Ile Ser Leu Ser Gly Ile Leu Tyr Ser 130 135 140 Tyr Phe Lys Ile Val Ser Ser Ile Arg Ser Met Ser Ser Val Gln Gly 145 150 155 160 Lys Tyr Lys Ala Phe Ser Thr Cys Ala Ser His Leu Ser Ile Val Ser 165 170 175 Leu Phe Tyr Ser Thr Gly Leu Gly Val Tyr Val Ser Ser Ala Val Ile 180 185 190 Arg Ser Ser His Ser Ser Ala Ser Ala Ser Val Met Tyr Thr Val Val 195 200 205 Thr Pro Met Leu 210 17 646 DNA Rattus sp. J4 17 cataggctat tcatcttctg tcacacccaa tatgcttgtc aacttcctta taaagcaaaa 60 taccatctca taccttggat gttctataca gtttggctca gctgctttgt ttggaggtct 120 tgaatgcttc cttctggctg ccatggcgta tgatcgtttt gtagcaatct gcaacccact 180 gctttattca acgaaaatgt ccacacaagt ctgtgtccag ttggttgtgg gatcttatat 240 agggggattt cttaatgcct cctcttttac cctttccttt ttttccttgt ccttctgtgg 300 accaaataga atcaatcact tttactgtga ttttgctccg ttagtagaac tttcttgctc 360 tgatgtcagt gttcctgatg ctgttacctc attttctgct gcctcagtta ctatgctcac 420 agtgtttatc atagccatct cctataccta tatcctcatc accatcctga agatgcgttc 480 cactgagggt cgacagaaag cattctctac ctgcacttcc cacctcactg cagtcactct 540 gtgctatgga accatcacat tcatctatgt gatgcccaag tccagctact ccacagacca 600 gaacaaggtg gtgtctgtgt tttatatggt ggtgatcccc atgttg 646 18 215 PRT Rattus sp. J4 18 Ile Gly Tyr Ser Ser Ser Val Thr Pro Asn Met Leu Val Asn Phe Leu 1 5 10 15 Ile Lys Gln Asn Thr Ile Ser Tyr Leu Gly Cys Ser Ile Gln Phe Gly 20 25 30 Ser Ala Ala Leu Phe Gly Gly Leu Glu Cys Phe Leu Leu Ala Ala Met 35 40 45 Ala Tyr Asp Arg Phe Val Ala Ile Cys Asn Pro Leu Leu Tyr Ser Thr 50 55 60 Lys Met Ser Thr Gln Val Cys Val Gln Leu Val Val Gly Ser Tyr Ile 65 70 75 80 Gly Gly Phe Leu Asn Ala Ser Ser Phe Thr Leu Ser Phe Phe Ser Leu 85 90 95 Ser Phe Cys Gly Pro Asn Arg Ile Asn His Phe Tyr Cys Asp Phe Ala 100 105 110 Pro Leu Val Glu Leu Ser Cys Ser Asp Val Ser Val Pro Asp Ala Val 115 120 125 Thr Ser Phe Ser Ala Ala Ser Val Thr Met Leu Thr Val Phe Ile Ile 130 135 140 Ala Ile Ser Tyr Thr Tyr Ile Leu Ile Thr Ile Leu Lys Met Arg Ser 145 150 155 160 Thr Glu Gly Arg Gln Lys Ala Phe Ser Thr Cys Thr Ser His Leu Thr 165 170 175 Ala Val Thr Leu Cys Tyr Gly Thr Ile Thr Phe Ile Tyr Val Met Pro 180 185 190 Lys Ser Ser Tyr Ser Thr Asp Gln Asn Lys Val Val Ser Val Phe Tyr 195 200 205 Met Val Val Ile Pro Met Leu 210 215 19 481 DNA Rattus sp. J7 19 catctgcaag cccctgcact acaccaccat catgaataac cgagtgtgca cagttctagt 60 cctctcctgt tggtttgctg gcctgttgat catcctccca cctcttggtc atggcctcca 120 gctggagttc tgtgactcca atgtgattga tcattttggc tgtgatgcct ctccaattct 180 gcagataacc tgctcagaca cggtatttat agagaaaatt gtcttggctt ttgccatact 240 gacactcatc attactctgg tatgtgttgt tctctcctac acatacatca tcaagaccat 300 tttaaagttt ccttctgctc aacaaagaaa aaaggccttt tctacatgtt cttcccacat 360 gattgtggtt tccatcacct atgggagctg tattttcatc tacatcaaac cttcagcgaa 420 ggaaggggta gccatcaata aggttgtatc tgtgctcaca acatcagtcg cccctttgct 480 c 481 20 160 PRT Rattus sp. J7 20 Ile Cys Lys Pro Leu His Tyr Thr Thr Ile Met Asn Asn Arg Val Cys 1 5 10 15 Thr Val Leu Val Leu Ser Cys Trp Phe Ala Gly Leu Leu Ile Ile Leu 20 25 30 Pro Pro Leu Gly His Gly Leu Gln Leu Glu Phe Cys Asp Ser Asn Val 35 40 45 Ile Asp His Phe Gly Cys Asp Ala Ser Pro Ile Leu Gln Ile Thr Cys 50 55 60 Ser Asp Thr Val Phe Ile Glu Lys Ile Val Leu Ala Phe Ala Ile Leu 65 70 75 80 Thr Leu Ile Ile Thr Leu Val Cys Val Val Leu Ser Tyr Thr Tyr Ile 85 90 95 Ile Lys Thr Ile Leu Lys Phe Pro Ser Ala Gln Gln Arg Lys Lys Ala 100 105 110 Phe Ser Thr Cys Ser Ser His Met Ile Val Val Ser Ile Thr Tyr Gly 115 120 125 Ser Cys Ile Phe Ile Tyr Ile Lys Pro Ser Ala Lys Glu Gly Val Ala 130 135 140 Ile Asn Lys Val Val Ser Val Leu Thr Thr Ser Val Ala Pro Leu Leu 145 150 155 160 21 481 DNA Rattus sp. J8 misc_feature ()..() n = unknown 21 catctgccac ccgctccact actctcttct catgagtcct gacaactgtg ctgctctggt 60 aacagtctcc tgggtgacag gggtgggcac gggcttcctg ccttccctcc tgatttctaa 120 gttggacttc tgtgggccca accgcatcaa ccatttcttc tgtgacctcc ctccattaat 180 ccagctgtcc tgctccagcg tctttgtgac agaaatggcc atctttgtcc tgtccatcgc 240 tgtgctctgc atctgtttcc tcctaacccn nnnntcctac attttcatag tgtcctccat 300 tctgagaatc ccttccacta ccggcaggat gaagacattt tctacatgtg gctcccacct 360 ggccgtggtc accatctact atgggaccat gatctccatg tatgtcggcc caaatgcgca 420 tctgtccccg gagctcaaca aggtcatttc tgtcttctac actgtgatca ccccactact 480 g 481 22 160 PRT Rattus sp. J8 UNSURE (90)..(91) x = unknown 22 Ile Cys His Pro Leu His Tyr Ser Leu Leu Met Ser Pro Asp Asn Cys 1 5 10 15 Ala Ala Leu Val Thr Val Ser Trp Val Thr Gly Val Gly Thr Gly Phe 20 25 30 Leu Pro Ser Leu Leu Ile Ser Lys Leu Asp Phe Cys Gly Pro Asn Arg 35 40 45 Ile Asn His Phe Phe Cys Asp Leu Pro Pro Leu Ile Gln Leu Ser Cys 50 55 60 Ser Ser Val Phe Val Thr Glu Met Ala Ile Phe Val Leu Ser Ile Ala 65 70 75 80 Val Leu Cys Ile Cys Phe Leu Leu Thr Xaa Xaa Ser Tyr Ile Phe Ile 85 90 95 Val Ser Ser Ile Leu Arg Ile Pro Ser Thr Thr Gly Arg Met Lys Thr 100 105 110 Phe Ser Thr Cys Gly Ser His Leu Ala Val Val Thr Ile Tyr Tyr Gly 115 120 125 Thr Met Ile Ser Met Tyr Val Gly Pro Asn Ala His Leu Ser Pro Glu 130 135 140 Leu Asn Lys Val Ile Ser Val Phe Tyr Thr Val Ile Thr Pro Leu Leu 145 150 155 160 23 646 DNA Rattus sp. J11 misc_feature ()..() n = unknown 23 ngtctgcttc tcctccacca ctgtccccaa ggtactggct aaccacatac tcagtagtca 60 ggccatttcc ttctctgggt gtctaactca gctgtatttt ctctgtgtgt ctgtgaatat 120 ggacaatttc ctgctggctg tgatggccta tgacagattt gtggccatat gccacccttt 180 gtactacaca acaaagatga cccaccagct ctgtgtcttg ctggtgtctg gatcannnnn 240 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 300 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 360 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nntgtgatca tggtcacccc 420 atttgtctgc atcctcatct cttacatcta catcaccaat gcagtcctca gagtctcatc 480 ctttagggga ggatggaaag ccttctccac ctgtggctca cacctggctg tggtctgcct 540 cttctatggc accatcattg ctgtgtattt caatcctgta tcttcccatt catctgagaa 600 ggacactgca gcaactgtgc tatacacagt ggtgactccc atgttg 646 24 215 PRT Rattus sp. J11 UNSURE (79)..(134) x = unknown 24 Val Cys Phe Ser Ser Thr Thr Val Pro Lys Val Leu Ala Asn His Ile 1 5 10 15 Leu Ser Ser Gln Ala Ile Ser Phe Ser Gly Cys Leu Thr Gln Leu Tyr 20 25 30 Phe Leu Cys Val Ser Val Asn Met Asp Asn Phe Leu Leu Ala Val Met 35 40 45 Ala Tyr Asp Arg Phe Val Ala Ile Cys His Pro Leu Tyr Tyr Thr Thr 50 55 60 Lys Met Thr His Gln Leu Cys Val Leu Leu Val Ser Gly Ser Xaa Xaa 65 70 75 80 Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 85 90 95 Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 100 105 110 Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 115 120 125 Xaa Xaa Xaa Xaa Xaa Xaa Val Ile Met Val Thr Pro Phe Val Cys Ile 130 135 140 Leu Ile Ser Tyr Ile Tyr Ile Thr Asn Ala Val Leu Arg Val Ser Ser 145 150 155 160 Phe Arg Gly Gly Trp Lys Ala Phe Ser Thr Cys Gly Ser His Leu Ala 165 170 175 Val Val Cys Leu Phe Tyr Gly Thr Ile Ile Ala Val Tyr Phe Asn Pro 180 185 190 Val Ser Ser His Ser Ser Glu Lys Asp Thr Ala Ala Thr Val Leu Tyr 195 200 205 Thr Val Val Thr Pro Met Leu 210 215 25 646 DNA Rattus sp. J14 misc_feature ()..() n = unknown 25 tgtctgcttc tcctccacca ctgtccccaa ggtactggct aaccacatac tcagtagtca 60 ggccatttcc ttctctgggt gtctaactca gctgtatttt ctctgtgtgt ctgtgaatat 120 ggacaatttc ctgctggctg tgatggccta tgacagattt gtggccatat gccacccttt 180 gtactacaca acaccgatga cccaccagct ctgtgtcttg ctggtgtctg gatcannnnn 240 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 300 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 360 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nntgtgatca tggtcacccc 420 atttgtctgc atcctcatct cttacatcta catcaccaat gcagtcctca gagtctcatc 480 ctttagggga ggatggaaag ccttctccac ctgtggctca cacctggctg tggtctgcct 540 cttctatggc accatcattg ctgtgtattt caatcctgta tcttcccatt catctgagaa 600 ggacactgca gcaactgtgc tatacacagt ggtgactccc atgttg 646 26 215 PRT Rattus sp. J14 UNSURE (79)..(134) x = unknown 26 Val Cys Phe Ser Ser Thr Thr Val Pro Lys Val Leu Ala Asn His Ile 1 5 10 15 Leu Ser Ser Gln Ala Ile Ser Phe Ser Gly Cys Leu Thr Gln Leu Tyr 20 25 30 Phe Leu Cys Val Ser Val Asn Met Asp Asn Phe Leu Leu Ala Val Met 35 40 45 Ala Tyr Asp Arg Phe Val Ala Ile Cys His Pro Leu Tyr Tyr Thr Thr 50 55 60 Pro Met Thr His Gln Leu Cys Val Leu Leu Val Ser Gly Ser Xaa Xaa 65 70 75 80 Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 85 90 95 Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 100 105 110 Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 115 120 125 Xaa Xaa Xaa Xaa Xaa Xaa Val Ile Met Val Thr Pro Phe Val Cys Ile 130 135 140 Leu Ile Ser Tyr Ile Tyr Ile Thr Asn Ala Val Leu Arg Val Ser Ser 145 150 155 160 Phe Arg Gly Gly Trp Lys Ala Phe Ser Thr Cys Gly Ser His Leu Ala 165 170 175 Val Val Cys Leu Phe Tyr Gly Thr Ile Ile Ala Val Tyr Phe Asn Pro 180 185 190 Val Ser Ser His Ser Ser Glu Lys Asp Thr Ala Ala Thr Val Leu Tyr 195 200 205 Thr Val Val Thr Pro Met Leu 210 215 27 481 DNA Rattus sp. J15 misc_feature ()..() x = unknown 27 tatctgcaac cctctgcgct acccagtgct catgagcggc cgggtgtgcc tgctcatggt 60 cgtggcctcc tggttgggag gatccctcaa cgcctccatt cagacttctc tgacccttca 120 gttcccctac tgtggatcac ggaagatctc ccacttcttc tgtgaggtgc cctcgctgct 180 gannntggcc tgtgcagaca ctgaagccta tgagcaggta ctatttgtga caggcgtggt 240 ggtcctcctg gtgcccatta cattcattac tgcctcttat gccctcatcc tggctgctgt 300 gctccgaatg cactctgcgg aggggagtca gaaggcccta gccacatgct cctctcacct 360 gacagtcgtc aatctcttct atgggcccct tgtctacacc tacatgttac ctgcttccta 420 tcactcacca ggccaagacg acatagtatc cgtcttttac accgttctca cacccatgct 480 t 481 28 160 PRT Rattus sp. J15 UNSURE (61)..(62) x = unknown 28 Ile Cys Asn Pro Leu Arg Tyr Pro Val Leu Met Ser Gly Arg Val Cys 1 5 10 15 Leu Leu Met Val Val Ala Ser Trp Leu Gly Gly Ser Leu Asn Ala Ser 20 25 30 Ile Gln Thr Ser Leu Thr Leu Gln Phe Pro Tyr Cys Gly Ser Arg Lys 35 40 45 Ile Ser His Phe Phe Cys Glu Val Pro Ser Leu Leu Xaa Xaa Ala Cys 50 55 60 Ala Asp Thr Glu Ala Tyr Glu Gln Val Leu Phe Val Thr Gly Val Val 65 70 75 80 Val Leu Leu Val Pro Ile Thr Phe Ile Thr Ala Ser Tyr Ala Leu Ile 85 90 95 Leu Ala Ala Val Leu Arg Met His Ser Ala Glu Gly Ser Gln Lys Ala 100 105 110 Leu Ala Thr Cys Ser Ser His Leu Thr Val Val Asn Leu Phe Tyr Gly 115 120 125 Pro Leu Val Tyr Thr Tyr Met Leu Pro Ala Ser Tyr His Ser Pro Gly 130 135 140 Gln Asp Asp Ile Val Ser Val Phe Tyr Thr Val Leu Thr Pro Met Leu 145 150 155 160 29 481 DNA Rattus sp. J16 29 catctgtagg cctcttcact atcctaccct catgacccag acactgtgtg ccaagattgc 60 cactggttgc tggttgggag gcttggctgg gccagtggta gaaatttcct tggtgtctcg 120 tctccttttt tgtggcccca atcacattca acacatcttt tgtgatttcc cacctgtgct 180 gagcttggct tgtactgata catcagtgaa tgtcctggta gattttatta taaacctctg 240 caagatcctg gccaccttcc tgctgatcct gagctcctac ttgcagataa tccgcacagt 300 gctcaagatt ccttcagctg caggcaagaa gaaagcattc tcgacttgtg cctcccatct 360 cactgtggtt ctcatcttct atgggagcat ccttttcatg tatgtgcggc tgaagaagac 420 ttactccctt gactacgaca gagccttggc agtagtctac tccgtggtta cccctttcct 480 g 481 30 160 PRT Rattus sp. J16 30 Ile Cys Arg Pro Leu His Tyr Pro Thr Leu Met Thr Gln Thr Leu Cys 1 5 10 15 Ala Lys Ile Ala Thr Gly Cys Trp Leu Gly Gly Leu Ala Gly Pro Val 20 25 30 Val Glu Ile Ser Leu Val Ser Arg Leu Leu Phe Cys Gly Pro Asn His 35 40 45 Ile Gln His Ile Phe Cys Asp Phe Pro Pro Val Leu Ser Leu Ala Cys 50 55 60 Thr Asp Thr Ser Val Asn Val Leu Val Asp Phe Ile Ile Asn Leu Cys 65 70 75 80 Lys Ile Leu Ala Thr Phe Leu Leu Ile Leu Ser Ser Tyr Leu Gln Ile 85 90 95 Ile Arg Thr Val Leu Lys Ile Pro Ser Ala Ala Gly Lys Lys Lys Ala 100 105 110 Phe Ser Thr Cys Ala Ser His Leu Thr Val Val Leu Ile Phe Tyr Gly 115 120 125 Ser Ile Leu Phe Met Tyr Val Arg Leu Lys Lys Thr Tyr Ser Leu Asp 130 135 140 Tyr Asp Arg Ala Leu Ala Val Val Tyr Ser Val Val Thr Pro Phe Leu 145 150 155 160 31 481 DNA Rattus sp. J17 misc_feature ()..() n = unknown 31 aatctgcaac ccactgcttt attccaccaa aatgtccaca caagtctgta tccagttggt 60 tgcaggatct tatatagggg gttttcttaa tacttgcctc atcatgtttt actttttctc 120 ttttctcttc tgtgggccaa atatagttga tcattttttc tgtgattttg ctcctttnnt 180 ggaactttcg tgctctgatg tgagtgtctc tgtagttgtt atgtcatttt ctgctggctc 240 agttactatg atcacagtgt ttatcatagc catctcctat tcttacatcc tcatcaccat 300 cctgaagatg tcctcaactg agggccgtca caaggctttc tccacatgta cctcccacct 360 cactgcagtc actctctact atggcaccat taccttcatt tatgtgatgc ccaagtccac 420 atactctaca gaccagaaca aggtggtgtc tgtgttttac atggtggtga tcccaatgtt 480 g 481 32 160 PRT Rattus sp. J17 UNSURE (59)..(60) x = unknown 32 Ile Cys Asn Pro Leu Leu Tyr Ser Thr Lys Met Ser Thr Gln Val Cys 1 5 10 15 Ile Gln Leu Val Ala Gly Ser Tyr Ile Gly Gly Phe Leu Asn Thr Cys 20 25 30 Leu Ile Met Phe Tyr Phe Phe Ser Phe Leu Phe Cys Gly Pro Asn Ile 35 40 45 Val Asp His Phe Phe Cys Asp Phe Ala Pro Xaa Xaa Glu Leu Ser Cys 50 55 60 Ser Asp Val Ser Val Ser Val Val Val Met Ser Phe Ser Ala Gly Ser 65 70 75 80 Val Thr Met Ile Thr Val Phe Ile Ile Ala Ile Ser Tyr Ser Tyr Ile 85 90 95 Leu Ile Thr Ile Leu Lys Met Ser Ser Thr Glu Gly Arg His Lys Ala 100 105 110 Phe Ser Thr Cys Thr Ser His Leu Thr Ala Val Thr Leu Tyr Tyr Gly 115 120 125 Thr Ile Thr Phe Ile Tyr Val Met Pro Lys Ser Thr Tyr Ser Thr Asp 130 135 140 Gln Asn Lys Val Val Ser Val Phe Tyr Met Val Val Ile Pro Met Leu 145 150 155 160 33 479 DNA Rattus sp. J19 33 tatctgccac cctctgaagt acacagttat catgaatcac tatttttgtg tgatgctgct 60 gctcttctct gtgttcgtta gcattgcaca tgcgttgttc cacattttaa tggtgttgat 120 actgactttc agcacaaaaa ctgaaatccc tcactttttc tgtgagctgg ctcatatcat 180 caaacttacc tgttccgata attttatcaa ctatctgctg atatacacag agtctgtctt 240 attttttggt gttcatattg tagggatcat tttgtcttat atttacactg tatcctcagt 300 tttaagaatg tcattattgg gaggaatgta taaagccttt tcaacatgtg gatctcattt 360 gtcggttgtc tctgttttat ggcacaggtt ttggggtaca cataagctct ccacttactg 420 actctccaag gaagactgta gtggcttcag tgatgtacac tgtggttact cagatgctg 479 34 139 PRT Rattus sp. J19 34 Ile Cys His Pro Leu Lys Tyr Thr Val Ile Met Asn His Tyr Phe Cys 1 5 10 15 Val Met Leu Leu Leu Phe Ser Val Phe Val Ser Ile Ala His Ala Leu 20 25 30 Phe His Ile Leu Met Val Leu Ile Leu Thr Phe Ser Thr Lys Thr Glu 35 40 45 Ile Pro His Phe Phe Cys Glu Leu Ala His Ile Ile Lys Leu Thr Cys 50 55 60 Ser Asp Asn Phe Ile Asn Tyr Leu Leu Ile Tyr Thr Glu Ser Val Leu 65 70 75 80 Phe Phe Gly Val His Ile Val Gly Ile Ile Leu Ser Tyr Ile Tyr Thr 85 90 95 Val Ser Ser Val Leu Arg Met Ser Leu Leu Gly Gly Met Tyr Lys Ala 100 105 110 Phe Ser Thr Cys Gly Ser His Leu Ser Val Val Ser Val Leu Trp His 115 120 125 Arg Phe Trp Gly Thr His Lys Leu Ser Thr Tyr 130 135 35 480 DNA Rattus sp. J20 misc_feature ()..() n = unknown 35 aatctgctac ccactgaggt accttctcat catgagctgg gtggtgtgca cagcactgtc 60 cgtggcaatc tgggtcatag gcttttgtgc ctccgttata cctctctgct tcacgatcct 120 cccactctgt ggtccttacg tcgttgatta tcttttctgc gagctgccca tccttctgca 180 cctgttctgc acagatacat ctctgctgga gnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 240 nnnnnnnnnn nncccttcct cctgattgtt ctctcctacc ttcgcatcct ggtggctgtg 300 ataagaatag actcagctga gggcagaaaa aaggcctttt caacttgtgc ttcacacttg 360 gctgtggtga ccatctacta tggaacaggg ctgatcaggt acttgaggcc caagtccctt 420 tattccgctg agggagacag actgatctct gtgttctatg cagtcattgg ccctgcactg 480 36 160 PRT Rattus sp. J2O UNSURE (71)..(84) x = unknown 36 Ile Cys Tyr Pro Leu Arg Tyr Leu Leu Ile Met Ser Trp Val Val Cys 1 5 10 15 Thr Ala Leu Ser Val Ala Ile Trp Val Ile Gly Phe Cys Ala Ser Val 20 25 30 Ile Pro Leu Cys Phe Thr Ile Leu Pro Leu Cys Gly Pro Tyr Val Val 35 40 45 Asp Tyr Leu Phe Cys Glu Leu Pro Ile Leu Leu His Leu Phe Cys Thr 50 55 60 Asp Thr Ser Leu Leu Glu Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 65 70 75 80 Xaa Xaa Xaa Xaa Pro Phe Leu Leu Ile Val Leu Ser Tyr Leu Arg Ile 85 90 95 Leu Val Ala Val Ile Arg Ile Asp Ser Ala Glu Gly Arg Lys Lys Ala 100 105 110 Phe Ser Thr Cys Ala Ser His Leu Ala Val Val Thr Ile Tyr Tyr Gly 115 120 125 Thr Gly Leu Ile Arg Tyr Leu Arg Pro Lys Ser Leu Tyr Ser Ala Glu 130 135 140 Gly Asp Arg Leu Ile Ser Val Phe Tyr Ala Val Ile Gly Pro Ala Leu 145 150 155 160 37 35 DNA artificial - primer A1 modified_base (9)..(9) i 37 aantnnatnn tnntnaannt ngcngtngcn gcnga 35 38 32 DNA artificial - primer A2 misc_feature (3)..(3) n = c or t 38 aantanttnn tnntnaanct ngcnntngcn ga 32 39 32 DNA artificial - primer A3 misc_feature (3)..(4) n = c or t 39 aannnnttnn tnatnncnct ngcntnngcn ga 32 40 32 DNA artificial - primer A4 misc_feature (1)..(1) n = c or a 40 ngnttnntna tgtgnaanct nnnnttngcn ga 32 41 32 DNA artificial - primer A5 modified_base (3)..(3) i 41 acngtntana tnacncannt nnnnatngcn ga 32 42 33 DNA artificial - primer B1 modified_base (4)..(4) i 42 ctgnnnnttc atnannnnnt anannanngg ntt 33 43 31 DNA artificial - primer B2 misc_feature (1)..(1) n = g or t 43 nntnnttnag ncancantan atnatnggnt t 31 44 32 DNA artificial - primer B3 modified_base (3)..(3) i 44 tcnatnttna angtngtnta natnatnggn tt 32 45 32 DNA artificial - primer B4 misc_feature (3)..(3) n = c or t 45 gcnttngtna anatngcnta nagnaanggn tt 32 46 32 DNA artificial - primer B5 misc_feature (3)..(3) n = a or g 46 aantcnggnn nncgnnanta natnannggn tt 32 47 32 DNA artificial - primer B6 misc_feature (1)..(1) n = g or c 47 nnnnnnccna cnaanaanta natnaanggn tt 32 48 23 DNA artificial - primer P1 modified_base (6)..(6) i 48 atggcntang anngntangt ngc 23 49 29 DNA artificial - primer P4 modified_base (3)..(3) i 49 aanannnnna cnannnnnan ntgnnnnnc 29 50 6 PRT artificial - motif 50 Lys Ile Val Ser Ser Ile 1 5 51 6 PRT artificial - motif 51 Arg Ile Val Ser Ser Ile 1 5 52 6 PRT artificial - motif 52 His Ile Thr Cys Ala Val 1 5 53 6 PRT artificial - motif 53 His Ile Thr Trp Ala Val 1 5 54 19 PRT Rattus sp. 54 Leu Ser Lys Glu Asp Cys Ser Gly Phe Ser Asp Val His Cys Gly Tyr 1 5 10 15 Ser Asp Ala 55 9 PRT Artificial - motif UNSURE (2)..(7) x = unknown 55 Leu Xaa Xaa Pro Met Tyr Xaa Phe Leu 1 5 56 9 PRT Artificial - motif VARIANT (2)..(2) X = H or Q 56 Leu Xaa Xaa Pro Met Tyr Xaa Phe Leu 1 5 57 10 PRT Artificial - motif UNSURE (2)..(7) X = UNKNOWN 57 Met Xaa Tyr Asp Arg Xaa Xaa Ala Ile Cys 1 5 10 58 10 PRT Artificial - motif VARIANT (2)..(2) X = A OR S 58 Met Xaa Tyr Asp Arg Xaa Xaa Ala Ile Cys 1 5 10 59 7 PRT Artificial - motif UNSURE (3)..(4) X = Unknown 59 Asp Arg Xaa Xaa Ala Ile Cys 1 5 60 7 PRT Artificial - motif VARIANT (3)..(3) X = F or Y 60 Asp Arg Xaa Xaa Ala Ile Cys 1 5 61 9 PRT Artificial - motif UNSURE (2)..(7) X = Unknown 61 Xaa Xaa Phe Ser Thr Cys Xaa Ser His 1 5 62 9 PRT Artificial - motif VARIANT (1)..(1) X = K or R 62 Xaa Xaa Phe Ser Thr Cys Xaa Ser His 1 5 63 7 PRT Artificial - motif UNSURE (5)..(5) X = Unknown 63 Phe Ser Thr Cys Xaa Ser His 1 5 64 7 PRT Artificial - motif VARIANT (5)..(5) X = A or G or S 64 Phe Ser Thr Cys Xaa Ser His 1 5 65 12 PRT Artificial - motif UNSURE (2)..(9) X = Unknown 65 Pro Xaa Xaa Asn Pro Xaa Ile Tyr Xaa Leu Arg Asn 1 5 10 66 12 PRT Artificial - motif VARIANT (2)..(2) X = M or L or V 66 Pro Xaa Xaa Asn Pro Xaa Ile Tyr Xaa Leu Arg Asn 1 5 10 67 8 PRT Artificial - motif UNSURE (2)..(6) X = Unknown 67 Pro Xaa Xaa Asn Pro Xaa Ile Tyr 1 5 68 8 PRT Artificial - motif VARIANT (2)..(2) X = M or L or V 68 Pro Xaa Xaa Asn Pro Xaa Ile Tyr 1 5 69 9 PRT Artificial - motif UNSURE (3)..(6) X = Unknown 69 Asn Pro Xaa Ile Tyr Xaa Leu Arg Asn 1 5 70 9 PRT Artificial - motif VARIANT (3)..(3) X = F or I 70 Asn Pro Xaa Ile Tyr Xaa Leu Arg Asn 1 5 71 333 PRT Rattus sp. F3 71 Met Asp Ser Ser Asn Arg Thr Arg Val Ser Glu Phe Leu Leu Leu Gly 1 5 10 15 Phe Val Glu Asn Lys Asp Leu Gln Pro Leu Ile Tyr Gly Leu Phe Leu 20 25 30 Ser Met Tyr Leu Val Thr Val Ile Gly Asn Ile Ser Ile Ile Val Ala 35 40 45 Ile Ile Ser Asp Pro Cys Leu His Thr Pro Met Tyr Phe Phe Leu Ser 50 55 60 Asn Leu Ser Phe Val Asp Ile Cys Phe Ile Ser Thr Thr Val Pro Lys 65 70 75 80 Met Leu Val Asn Ile Gln Thr Gln Asn Asn Val Ile Thr Tyr Ala Gly 85 90 95 Cys Ile Thr Gln Ile Tyr Phe Phe Leu Leu Phe Val Glu Leu Asp Asn 100 105 110 Phe Leu Leu Thr Ile Met Ala Tyr Asp Arg Tyr Val Ala Ile Cys His 115 120 125 Pro Met His Tyr Thr Val Ile Met Asn Tyr Lys Leu Cys Gly Phe Leu 130 135 140 Val Leu Val Ser Trp Ile Val Ser Val Leu His Ala Leu Phe Gln Ser 145 150 155 160 Leu Met Met Leu Ala Leu Pro Phe Cys Thr His Leu Glu Ile Pro His 165 170 175 Tyr Phe Cys Glu Pro Asn Gln Val Ile Gln Leu Thr Cys Ser Asp Ala 180 185 190 Phe Leu Asn Asp Leu Val Ile Tyr Phe Thr Leu Val Leu Leu Ala Thr 195 200 205 Val Pro Leu Ala Gly Ile Phe Tyr Ser Tyr Phe Lys Ile Val Ser Ser 210 215 220 Ile Cys Ala Ile Ser Ser Val His Gly Lys Tyr Lys Ala Phe Ser Thr 225 230 235 240 Cys Ala Ser His Leu Ser Val Val Ser Leu Phe Tyr Cys Thr Gly Leu 245 250 255 Gly Val Tyr Leu Ser Ser Ala Ala Asn Asn Ser Ser Gln Ala Ser Ala 260 265 270 Thr Ala Ser Val Met Tyr Thr Val Val Thr Pro Met Val Asn Pro Phe 275 280 285 Ile Tyr Ser Leu Arg Asn Lys Asp Val Lys Ser Val Leu Lys Lys Thr 290 295 300 Leu Cys Glu Glu Val Ile Arg Ser Pro Pro Ser Leu Leu His Phe Phe 305 310 315 320 Leu Val Leu Cys His Leu Pro Cys Phe Ile Phe Cys Tyr 325 330 72 313 PRT Rattus sp. F5 72 Met Ser Ser Thr Asn Gln Ser Ser Val Thr Glu Phe Leu Leu Leu Gly 1 5 10 15 Leu Ser Arg Gln Pro Gln Gln Gln Gln Leu Leu Phe Leu Leu Phe Leu 20 25 30 Ile Met Tyr Leu Ala Thr Val Leu Gly Asn Leu Leu Ile Ile Leu Ala 35 40 45 Ile Gly Thr Asp Ser Arg Leu His Thr Pro Met Tyr Phe Phe Leu Ser 50 55 60 Asn Leu Ser Phe Val Asp Val Cys Phe Ser Ser Thr Thr Val Pro Lys 65 70 75 80 Val Leu Ala Asn His Ile Leu Gly Ser Gln Ala Ile Ser Phe Ser Gly 85 90 95 Cys Leu Thr Gln Leu Tyr Phe Leu Ala Val Phe Gly Asn Met Asp Asn 100 105 110 Phe Leu Leu Ala Val Met Ser Tyr Asp Arg Phe Val Ala Ile Cys His 115 120 125 Pro Leu His Tyr Thr Thr Lys Met Thr Arg Gln Leu Cys Val Leu Leu 130 135 140 Val Val Gly Ser Trp Val Val Ala Asn Met Asn Cys Leu Leu His Ile 145 150 155 160 Leu Leu Met Ala Arg Leu Ser Phe Cys Ala Asp Asn Met Ile Pro His 165 170 175 Phe Phe Cys Asp Gly Thr Pro Leu Leu Lys Leu Ser Cys Ser Asp Thr 180 185 190 His Leu Asn Glu Leu Met Ile Leu Thr Glu Gly Ala Val Val Met Val 195 200 205 Thr Pro Phe Val Cys Ile Leu Ile Ser Tyr Ile His Ile Thr Cys Ala 210 215 220 Val Leu Arg Val Ser Ser Pro Arg Gly Gly Trp Lys Ser Phe Ser Thr 225 230 235 240 Cys Gly Ser His Leu Ala Val Val Cys Leu Phe Tyr Gly Thr Val Ile 245 250 255 Ala Val Tyr Phe Asn Pro Ser Ser Ser His Leu Ala Gly Arg Asp Met 260 265 270 Ala Ala Ala Val Met Tyr Ala Val Val Thr Pro Met Leu Asn Pro Phe 275 280 285 Ile Tyr Ser Leu Arg Asn Ser Asp Met Lys Ala Ala Leu Arg Lys Val 290 295 300 Leu Ala Met Arg Phe Pro Ser Lys Gln 305 310 73 311 PRT Rattus sp. F6 73 Met Ala Trp Ser Thr Gly Gln Asn Leu Ser Thr Pro Gly Pro Phe Ile 1 5 10 15 Leu Leu Gly Phe Pro Gly Pro Arg Ser Met Arg Ile Gly Leu Phe Leu 20 25 30 Leu Phe Leu Val Met Tyr Leu Leu Thr Val Val Gly Asn Leu Ala Ile 35 40 45 Ile Ser Leu Val Gly Ala His Arg Cys Leu Gln Thr Pro Met Tyr Phe 50 55 60 Phe Leu Cys Asn Leu Ser Phe Leu Glu Ile Trp Phe Thr Thr Ala Cys 65 70 75 80 Val Pro Lys Thr Leu Ala Thr Phe Ala Pro Arg Gly Gly Val Ile Ser 85 90 95 Leu Ala Gly Cys Ala Thr Gln Met Tyr Phe Val Phe Ser Leu Gly Cys 100 105 110 Thr Glu Tyr Phe Leu Leu Ala Val Met Ala Tyr Asp Arg Tyr Leu Ala 115 120 125 Ile Cys Leu Pro Leu Arg Tyr Gly Gly Ile Met Thr Pro Gly Leu Ala 130 135 140 Met Arg Leu Ala Leu Gly Ser Trp Leu Cys Gly Phe Ser Ala Ile Thr 145 150 155 160 Val Pro Ala Thr Leu Ile Ala Arg Leu Ser Phe Cys Gly Ser Arg Val 165 170 175 Ile Asn His Phe Phe Cys Asp Ile Ser Pro Trp Ile Val Leu Ser Cys 180 185 190 Thr Asp Thr Gln Val Val Glu Leu Val Ser Phe Gly Ile Ala Phe Cys 195 200 205 Val Ile Leu Gly Ser Cys Gly Ile Thr Leu Val Ser Tyr Ala Tyr Ile 210 215 220 Ile Thr Thr Ile Ile Lys Ile Pro Ser Ala Arg Gly Arg His Arg Ala 225 230 235 240 Phe Ser Thr Cys Ser Ser His Leu Thr Val Val Leu Ile Trp Tyr Gly 245 250 255 Ser Thr Ile Phe Leu His Val Arg Thr Ser Val Glu Ser Ser Leu Asp 260 265 270 Leu Thr Lys Ala Ile Thr Val Leu Asn Thr Ile Val Thr Pro Val Leu 275 280 285 Asn Pro Phe Ile Tyr Thr Leu Arg Asn Lys Asp Val Lys Glu Ala Leu 290 295 300 Arg Arg Thr Val Lys Gly Lys 305 310 74 317 PRT Rattus sp. F12 74 Met Glu Ser Gly Asn Ser Thr Arg Arg Phe Ser Ser Phe Phe Leu Leu 1 5 10 15 Gly Phe Thr Glu Asn Pro Gln Leu His Phe Leu Ile Phe Ala Leu Phe 20 25 30 Leu Ser Met Tyr Leu Val Thr Val Leu Gly Asn Leu Leu Ile Ile Met 35 40 45 Ala Ile Ile Thr Gln Ser His Leu His Thr Pro Met Tyr Phe Phe Leu 50 55 60 Ala Asn Leu Ser Phe Val Asp Ile Cys Phe Thr Ser Thr Thr Ile Pro 65 70 75 80 Lys Met Leu Val Asn Ile Tyr Thr Gln Ser Lys Ser Ile Thr Tyr Glu 85 90 95 Asp Cys Ile Ser Gln Met Cys Val Phe Leu Val Phe Ala Glu Leu Gly 100 105 110 Asn Phe Leu Leu Ala Val Met Ala Tyr Asp Arg Tyr Val Ala Asn Cys 115 120 125 His Pro Leu Cys Tyr Thr Val Ile Val Asn His Arg Leu Cys Ile Leu 130 135 140 Leu Leu Leu Leu Ser Trp Val Ile Ser Ile Phe His Ala Phe Ile Gln 145 150 155 160 Ser Leu Ile Val Leu Gln Leu Thr Phe Cys Gly Asp Val Lys Ile Pro 165 170 175 His Phe Phe Cys Glu Leu Asn Gln Leu Ser Gln Leu Thr Cys Ser Asp 180 185 190 Asn Phe Pro Ser His Leu Ile Met Asn Leu Val Pro Val Met Leu Ala 195 200 205 Ala Ile Ser Phe Ser Gly Ile Leu Tyr Ser Tyr Phe Lys Ile Val Ser 210 215 220 Ser Ile His Ser Ile Ser Thr Val Gln Gly Lys Tyr Lys Ala Phe Ser 225 230 235 240 Thr Cys Ala Ser His Leu Ser Ile Val Ser Leu Phe Tyr Ser Thr Gly 245 250 255 Leu Gly Val Tyr Val Ser Ser Ala Val Val Gln Ser Ser His Ser Ala 260 265 270 Ala Ser Ala Ser Val Met Tyr Thr Val Val Thr Pro Met Leu Asn Pro 275 280 285 Phe Ile Tyr Ser Leu Arg Asn Lys Asp Val Lys Arg Ala Leu Glu Arg 290 295 300 Leu Leu Glu Gly Asn Cys Lys Val His His Trp Thr Gly 305 310 315 75 310 PRT Rattus sp. I3 75 Met Asn Asn Gln Thr Phe Ile Thr Gln Phe Leu Leu Leu Gly Leu Pro 1 5 10 15 Ile Pro Glu Glu His Gln His Leu Phe Tyr Ala Leu Phe Leu Val Met 20 25 30 Tyr Leu Thr Thr Ile Leu Gly Asn Leu Leu Ile Ile Val Leu Val Gln 35 40 45 Leu Asp Ser Gln Leu His Thr Pro Met Tyr Leu Phe Leu Ser Asn Leu 50 55 60 Ser Phe Ser Asp Leu Cys Phe Ser Ser Val Thr Met Pro Lys Leu Leu 65 70 75 80 Gln Asn Met Arg Ser Gln Asp Thr Ser Ile Pro Tyr Gly Gly Cys Leu 85 90 95 Ala Gln Thr Tyr Phe Phe Met Val Phe Gly Asp Met Glu Ser Phe Leu 100 105 110 Leu Val Ala Met Ala Tyr Asp Arg Tyr Val Ala Ile Cys Phe Pro Leu 115 120 125 His Tyr Thr Ser Ile Met Ser Pro Lys Leu Cys Thr Cys Leu Val Leu 130 135 140 Leu Leu Trp Met Leu Thr Thr Ser His Ala Met Met His Thr Leu Leu 145 150 155 160 Ala Ala Arg Leu Ser Phe Cys Glu Asn Asn Val Val Leu Asn Phe Phe 165 170 175 Cys Asp Leu Phe Val Leu Leu Lys Leu Ala Cys Ser Asp Thr Tyr Ile 180 185 190 Asn Glu Leu Met Ile Phe Ile Met Ser Thr Leu Leu Ile Ile Ile Pro 195 200 205 Phe Phe Leu Ile Val Met Ser Tyr Ala Arg Ile Ile Ser Ser Ile Leu 210 215 220 Lys Val Pro Ser Thr Gln Gly Ile Cys Lys Val Phe Ser Thr Cys Gly 225 230 235 240 Ser His Leu Ser Val Val Ser Leu Phe Tyr Gly Thr Ile Ile Gly Leu 245 250 255 Tyr Leu Cys Pro Ala Gly Asn Asn Ser Thr Val Lys Glu Met Val Met 260 265 270 Ala Met Met Tyr Thr Val Val Thr Pro Met Leu Asn Pro Phe Ile Tyr 275 280 285 Ser Leu Arg Asn Arg Asp Met Lys Arg Ala Leu Ile Arg Val Ile Cys 290 295 300 Ser Met Lys Ile Thr Leu 305 310 76 327 PRT Rattus sp. I7 76 Met Glu Arg Arg Asn His Ser Gly Arg Val Ser Glu Phe Val Leu Leu 1 5 10 15 Gly Phe Pro Ala Pro Ala Pro Leu Arg Val Leu Leu Phe Phe Leu Ser 20 25 30 Leu Leu Asp Tyr Val Leu Val Leu Thr Glu Asn Met Leu Ile Ile Ile 35 40 45 Ala Ile Arg Asn His Pro Thr Leu His Lys Pro Met Tyr Phe Phe Leu 50 55 60 Ala Asn Met Ser Phe Leu Glu Ile Trp Tyr Val Thr Val Thr Ile Pro 65 70 75 80 Lys Met Leu Ala Gly Phe Ile Gly Ser Lys Glu Asn His Gly Gln Leu 85 90 95 Ile Ser Phe Glu Ala Cys Met Thr Gln Leu Tyr Phe Phe Leu Gly Leu 100 105 110 Gly Cys Thr Glu Cys Val Leu Leu Ala Val Met Ala Tyr Asp Arg Tyr 115 120 125 Val Ala Ile Cys His Pro Leu His Tyr Pro Val Ile Val Ser Ser Arg 130 135 140 Leu Cys Val Gln Met Ala Ala Gly Ser Trp Ala Gly Gly Phe Gly Ile 145 150 155 160 Ser Met Val Lys Val Phe Leu Ile Ser Arg Leu Ser Tyr Cys Gly Pro 165 170 175 Asn Thr Ile Asn His Phe Phe Cys Asp Val Ser Pro Leu Leu Asn Leu 180 185 190 Ser Cys Thr Asp Met Ser Thr Ala Glu Leu Thr Asp Phe Val Leu Ala 195 200 205 Ile Phe Ile Leu Leu Gly Pro Leu Ser Val Thr Gly Ala Ser Tyr Met 210 215 220 Ala Ile Thr Gly Ala Val Met Arg Ile Pro Ser Ala Ala Gly Arg His 225 230 235 240 Lys Ala Phe Ser Thr Cys Ala Ser His Leu Thr Val Val Ile Ile Phe 245 250 255 Tyr Ala Ala Ser Ile Phe Ile Tyr Ala Arg Pro Lys Ala Leu Ser Ala 260 265 270 Phe Asp Thr Asn Lys Leu Val Ser Val Leu Tyr Ala Val Ile Val Pro 275 280 285 Leu Phe Asn Pro Ile Ile Tyr Cys Leu Arg Asn Gln Asp Val Lys Arg 290 295 300 Ala Leu Arg Arg Thr Leu His Leu Ala Gln Asp Gln Glu Ala Asn Thr 305 310 315 320 Asn Lys Gly Ser Lys Ile Gly 325 77 312 PRT Rattus sp. I8 77 Met Asn Asn Lys Thr Val Ile Thr His Phe Leu Leu Leu Gly Leu Pro 1 5 10 15 Ile Pro Pro Glu His Gln Gln Leu Phe Phe Ala Leu Phe Leu Ile Met 20 25 30 Tyr Leu Thr Thr Phe Leu Gly Asn Leu Leu Ile Val Val Leu Val Gln 35 40 45 Leu Asp Ser His Leu His Thr Pro Met Tyr Leu Phe Leu Ser Asn Leu 50 55 60 Ser Phe Ser Asp Leu Cys Phe Ser Ser Val Thr Met Leu Lys Leu Leu 65 70 75 80 Gln Asn Ile Gln Ser Gln Val Pro Ser Ile Ser Tyr Ala Gly Cys Leu 85 90 95 Thr Gln Ile Phe Phe Phe Leu Leu Phe Gly Tyr Leu Gly Asn Phe Leu 100 105 110 Leu Val Ala Met Ala Tyr Asp Arg Tyr Val Ala Ile Cys Phe Pro Leu 115 120 125 His Tyr Thr Asn Ile Met Ser His Lys Leu Cys Thr Cys Leu Leu Leu 130 135 140 Val Phe Trp Ile Met Thr Ser Ser His Ala Met Met His Thr Leu Leu 145 150 155 160 Ala Ala Arg Leu Ser Phe Cys Glu Asn Asn Val Leu Leu Asn Phe Phe 165 170 175 Cys Asp Leu Phe Val Leu Leu Lys Leu Ala Cys Ser Asp Thr Tyr Val 180 185 190 Asn Glu Leu Met Ile His Ile Met Gly Val Ile Ile Ile Val Ile Pro 195 200 205 Phe Val Leu Ile Val Ile Ser Tyr Ala Lys Ile Ile Ser Ser Ile Leu 210 215 220 Lys Val Pro Ser Thr Gln Ser Ile His Lys Val Phe Ser Thr Cys Gly 225 230 235 240 Ser His Leu Ser Val Val Ser Leu Phe Tyr Gly Thr Ile Ile Gly Leu 245 250 255 Tyr Leu Cys Pro Ser Gly Asp Asn Phe Ser Leu Lys Gly Ser Ala Met 260 265 270 Ala Met Met Tyr Thr Val Val Thr Pro Met Leu Asn Pro Phe Ile Tyr 275 280 285 Ser Leu Arg Asn Arg Asp Met Lys Gln Ala Leu Ile Arg Val Thr Cys 290 295 300 Ser Lys Lys Ile Ser Leu Pro Trp 305 310 78 314 PRT Rattus sp. I9 78 Met Thr Arg Arg Asn Gln Thr Ala Ile Ser Gln Phe Phe Leu Leu Gly 1 5 10 15 Leu Pro Phe Pro Pro Glu Tyr Gln His Leu Phe Tyr Ala Leu Phe Leu 20 25 30 Ala Met Tyr Leu Thr Thr Leu Leu Gly Asn Leu Ile Ile Ile Ile Leu 35 40 45 Ile Leu Leu Asp Ser His Leu His Thr Pro Met Tyr Leu Phe Leu Ser 50 55 60 Asn Leu Ser Phe Ala Asp Leu Cys Phe Ser Ser Val Thr Met Pro Lys 65 70 75 80 Leu Leu Gln Asn Met Gln Ser Gln Val Pro Ser Ile Pro Tyr Ala Gly 85 90 95 Cys Leu Ala Gln Ile Tyr Phe Phe Leu Phe Phe Gly Asp Leu Gly Asn 100 105 110 Phe Leu Leu Val Ala Met Ala Tyr Asp Arg Tyr Val Ala Ile Cys Phe 115 120 125 Pro Leu His Tyr Met Ser Ile Met Ser Pro Lys Leu Cys Val Ser Leu 130 135 140 Val Val Leu Ser Trp Val Leu Thr Thr Phe His Ala Met Leu His Thr 145 150 155 160 Leu Leu Met Ala Arg Leu Ser Phe Cys Glu Asp Ser Val Ile Pro His 165 170 175 Tyr Phe Cys Asp Met Ser Thr Leu Leu Lys Val Ala Cys Ser Asp Thr 180 185 190 His Asp Asn Glu Leu Ala Ile Phe Ile Leu Gly Gly Pro Ile Val Val 195 200 205 Leu Pro Phe Leu Leu Ile Ile Val Ser Tyr Ala Arg Ile Val Ser Ser 210 215 220 Ile Phe Lys Val Pro Ser Ser Gln Ser Ile His Lys Ala Phe Ser Thr 225 230 235 240 Cys Gly Ser His Leu Ser Val Val Ser Leu Phe Tyr Gly Thr Val Ile 245 250 255 Gly Leu Tyr Leu Cys Pro Ser Ala Asn Asn Ser Thr Val Lys Glu Thr 260 265 270 Val Met Ser Leu Met Tyr Thr Met Val Thr Pro Met Leu Asn Pro Phe 275 280 285 Ile Tyr Ser Leu Arg Asn Arg Asp Ile Lys Asp Ala Leu Glu Lys Ile 290 295 300 Met Cys Lys Lys Gln Ile Pro Ser Phe Leu 305 310 79 312 PRT Rattus sp. I14 79 Met Thr Gly Asn Asn Gln Thr Leu Ile Leu Glu Phe Leu Leu Leu Gly 1 5 10 15 Leu Pro Ile Pro Ser Glu Tyr His Leu Leu Phe Tyr Ala Leu Phe Leu 20 25 30 Ala Met Tyr Leu Thr Ile Ile Leu Gly Asn Leu Leu Ile Ile Val Leu 35 40 45 Val Arg Leu Asp Ser His Leu His Met Pro Met Tyr Leu Phe Leu Ser 50 55 60 Asn Leu Ser Phe Ser Asp Leu Cys Phe Ser Ser Val Thr Met Pro Lys 65 70 75 80 Leu Leu Gln Asn Met Gln Ser Gln Val Pro Ser Ile Ser Tyr Thr Gly 85 90 95 Cys Leu Thr Gln Leu Tyr Phe Phe Met Val Phe Gly Asp Met Glu Ser 100 105 110 Phe Leu Leu Val Val Met Ala Tyr Asp Arg Tyr Val Ala Ile Cys Phe 115 120 125 Pro Leu Arg Tyr Thr Thr Ile Met Ser Thr Lys Phe Cys Ala Ser Leu 130 135 140 Val Leu Leu Leu Trp Met Leu Thr Met Thr His Ala Leu Leu His Thr 145 150 155 160 Leu Leu Ile Ala Arg Leu Ser Phe Cys Glu Lys Asn Val Ile Leu His 165 170 175 Phe Phe Cys Asp Ile Ser Ala Leu Leu Lys Leu Ser Cys Ser Asp Ile 180 185 190 Tyr Val Asn Glu Leu Met Ile Tyr Ile Leu Gly Gly Leu Ile Ile Ile 195 200 205 Ile Pro Phe Leu Leu Ile Val Met Ser Tyr Val Arg Ile Phe Phe Ser 210 215 220 Ile Leu Lys Phe Pro Ser Ile Gln Asp Ile Tyr Lys Val Phe Ser Thr 225 230 235 240 Cys Gly Ser His Leu Ser Val Val Thr Leu Phe Tyr Gly Thr Ile Phe 245 250 255 Gly Ile Tyr Leu Cys Pro Ser Gly Asn Asn Ser Thr Val Lys Glu Ile 260 265 270 Ala Met Ala Met Met Tyr Thr Val Val Thr Pro Met Leu Asn Pro Phe 275 280 285 Ile Tyr Ser Leu Arg Asn Arg Asp Met Lys Arg Ala Leu Ile Arg Val 290 295 300 Ile Cys Thr Lys Lys Ile Ser Leu 305 310 80 314 PRT Rattus sp. I15 80 Met Thr Glu Glu Asn Gln Thr Val Ile Ser Gln Phe Leu Leu Leu Phe 1 5 10 15 Leu Pro Ile Pro Ser Glu His Gln His Val Phe Tyr Ala Leu Phe Leu 20 25 30 Ser Met Tyr Leu Thr Thr Val Leu Gly Asn Leu Ile Ile Ile Ile Leu 35 40 45 Ile His Leu Asp Ser His Leu His Thr Pro Met Tyr Leu Phe Leu Ser 50 55 60 Asn Leu Ser Phe Ser Asp Leu Cys Phe Ser Ser Val Thr Met Pro Lys 65 70 75 80 Leu Leu Gln Asn Met Gln Ser Gln Val Pro Ser Ile Pro Phe Ala Gly 85 90 95 Cys Leu Thr Gln Leu Tyr Phe Tyr Leu Tyr Phe Ala Asp Leu Glu Ser 100 105 110 Phe Leu Leu Val Ala Met Ala Tyr Asp Arg Tyr Val Ala Ile Cys Phe 115 120 125 Pro Leu His Tyr Met Ser Ile Met Ser Pro Lys Leu Cys Val Ser Leu 130 135 140 Val Val Leu Ser Trp Val Leu Thr Thr Phe His Ala Met Leu His Thr 145 150 155 160 Leu Leu Met Ala Arg Leu Ser Phe Cys Ala Asp Asn Met Ile Pro His 165 170 175 Phe Phe Cys Asp Ile Ser Pro Leu Leu Lys Leu Ser Cys Ser Asp Thr 180 185 190 His Val Asn Glu Leu Val Ile Phe Val Met Gly Gly Leu Val Ile Val 195 200 205 Ile Pro Phe Val Leu Ile Ile Val Ser Tyr Ala Arg Val Val Ala Ser 210 215 220 Ile Leu Lys Val Pro Ser Val Arg Gly Ile His Lys Ile Phe Ser Thr 225 230 235 240 Cys Gly Ser His Leu Ser Val Val Ser Leu Phe Tyr Gly Thr Ile Ile 245 250 255 Gly Leu Tyr Leu Cys Pro Ser Ala Asn Asn Ser Thr Val Lys Glu Thr 260 265 270 Val Met Ala Met Met Tyr Thr Val Val Thr Pro Met Leu Asn Pro Phe 275 280 285 Ile Tyr Ser Leu Arg Asn Arg Asp Met Lys Glu Ala Leu Ile Arg Val 290 295 300 Leu Cys Lys Lys Lys Ile Thr Phe Cys Leu 305 310 

What is claimed is:
 1. An isolated nucleic acid molecule encoding an odorant receptor protein, wherein the receptor protein comprises seven transmembrane domains, and is further characterized by at least one of the following characteristics: (a) the loop between the first transmembrane domain and the second transmembrane domain, and the second transmembrane domain together comprise consecutive amino acids having the following sequence: -L, X, X, P, M, Y, X, F, L- (SEQ ID NO: 55); (b) the third transmembrane domain, and the loop between the third transmembrane domain and the fourth transmembrane domain together comprise consecutive amino acids having one of the following sequences: -M, X, Y, D, R, X, X, A, I, C- (SEQ ID NO: 57); or -D, R, X, X, A, I, C- (SEQ ID NO: 59); (c) the loop between the fifth transmembrane domain and the sixth transmembrane domain, and the sixth transmembrane domain together comprise consecutive amino acids having one of the following sequences: -K or R, X, F, S, T, C, X, S, H- (SEQ ID NO: 61); or -F, S, T, C, X, S, H- (SEQ ID NO: 63); or (d) the seventh transmembrane domain and the C-terminal domain together comprise consecutive amino acids having one of the following sequences: -P, X, X, N, P, X, I, Y, X, L, R, N- (SEQ ID NO: 65); or -P, X, X, N, P, X, I, Y- (SEQ ID NO: 67); or -N, P, X, I, Y, X, L, R, N- (SEQ ID NO: 69); wherein X is any amino acid.
 2. The isolated nucleic acid molecule of claim 1 wherein: (a) the loop between the first transmembrane domain and the second transmembrane domain, and the second transmembrane domain together comprise consecutive amino acids having the following sequence: -L, H or Q, K or M or T, PMY, F or L, FL- (SEQ ID NO: 56); (b) the third transmembrane domain, and the loop between the third transmembrane domain and the fourth transmembrane domain together comprise consecutive amino acids having one of the following sequences: -M, A or S, YDR, F or Y, L or V, AIC- (SEQ ID NO: 58); or -DR, F or Y, L or V, AIC- (SEQ ID NO: 60); (c) the loop between the fifth transmembrane domain and the sixth transmembrane domain, and the sixth transmembrane domain together comprise consecutive amino acids having one of the following sequences: -K or R, A or I or S or V, FSTC, A or G or S, SH- (SEQ ID NO: 62); or -FSTC, A or G or S, SH- (SEQ ID NO: 64); or (d) the seventh transmembrane domain and the C-terminal domain together comprise consecutive amino acids having one of the following sequences: -P, M or L or V, F or L or V, NP, F or I, IY, C or S or T, LRN- (SEQ ID NO: 66); or -P, M or L or V, F or L or V, NP, F or I, IY- (SEQ ID NO: 68); or -NP, F or I, IY, C or S or T, LRN- (SEQ ID NO: 70).
 3. The isolated nucleic acid molecule of claim 1, wherein the receptor protein is characterized by at least two of the characteristics (a) through (d).
 4. The isolated nucleic acid molecule of claim 1, wherein the receptor protein is characterized by at least three of the characteristics (a) through (d).
 5. The isolated nucleic acid molecule of claim 1, wherein the receptor protein is characterized by all of the characteristics (a) through (d).
 6. An isolated nucleic acid molecule encoding an odorant receptor protein, wherein the nucleic acid molecule encodes a protein selected from the group consisting of: (a) an odorant receptor protein comprising consecutive amino acids having a sequence identical to that beginning with methionine at position 1 and ending with tyrosine at position 333 as set forth in row F3 of FIGS. 4A to 4M (SEQ ID NO: 71), (b) an odorant receptor protein comprising consecutive amino acids having a sequence identical to that beginning with methionine at position 1 and ending with glutamine at position 313 as set forth in row F5 of FIGS. 4A to 4L (SEQ ID NO: 72), (c) an odorant receptor protein comprising consecutive amino acids having a sequence identical to that beginning with methionine at position 1 and ending with lysine at position 311 as set forth in row F6 of FIGS. 4A to 4L (SEQ ID NO: 73), (d) an odorant receptor protein comprising consecutive amino acids having a sequence identical to that beginning with methionine at position 1 and ending with glycine at position 317 as set forth in row F12 of FIGS. 4A to 4L (SEQ ID NO: 74), (e) an odorant receptor protein comprising consecutive amino acids having a sequence identical to that beginning with methionine at position 1 and ending with leucine at position 310 as set forth in row I3 of FIGS. 4A to 4L (SEQ ID NO: 75), (f) an odorant receptor protein comprising consecutive amino acids having a sequence identical to that beginning with methionine at position 1 and ending with glycine at position 327 as set forth in row I7 of FIGS. 4A to 4L (SEQ ID NO: 76), (g) an odorant receptor protein comprising consecutive amino acids having a sequence identical to that beginning with methionine at position 1 and ending with tryptophan at position 312 as set forth in row I8 of FIGS. 4A to 4L (SEQ ID NO: 77), (h) an odorant receptor protein comprising consecutive amino acids having a sequence identical to that beginning with methionine at position 1 and ending with leucine at position 314 as set forth in row I9 of FIGS. 4A to 4L (SEQ ID NO: 78), (i) an odorant receptor protein comprising consecutive amino acids having a sequence identical to that beginning with methionine at position 1 and ending with leucine at position 312 as set forth in row I14 of FIGS. 4A to 4L (SEQ ID NO: 79), (j) an odorant receptor protein comprising consecutive amino acids having a sequence identical to that beginning with methionine at position 1 and ending with leucine at position 314 as set forth in row I15 of FIGS. 4A to 4L (SEQ ID NO: 80), and (k) an odorant receptor protein that shares from 40-80% amino acid identity with any one of the proteins of (a)-(j), comprises seven transmembrane domains, and is further characterized by at least one of the following characteristics: (i) the loop between the first transmembrane domain and the second transmembrane domain, and the second transmembrane domain together comprise consecutive amino acids having the following sequence: -L, X, X, P, M, Y, X, F, L- (SEQ ID NO: 55); (ii) the third transmembrane domain, and the loop between the third transmembrane domain and the fourth transmembrane domain together comprise consecutive amino acids having one of the following sequences: -M, X, Y, D, R, X, X, A, I, C- (SEQ ID NO: 57); or -D, R, X, X, A, I, C- (SEQ ID NO: 59); (iii) the loop between the fifth transmembrane domain and the sixth transmembrane domain, and the sixth transmembrane domain together comprise consecutive amino acids having one of the following sequences: -K or R, X, F, S, T, C, X, S, H- (SEQ ID NO: 61); or -F, S, T, C, X, S, H- (SEQ ID NO: 63); or (iv) the seventh transmembrane domain and the C-terminal domain together comprise consecutive amino acids having one of the following sequences: -P, X, X, N, P, X, I, Y, X, L, R, N- (SEQ ID NO: 65); or -P, X, X, N, P, X, I, Y- (SEQ ID NO: 67); or -N, P, X, I, Y, X, L, R, N- (SEQ ID NO: 69); wherein X is any amino acid.
 7. The isolated nucleic acid molecule of claim 6 wherein: (i) the loop between the first transmembrane domain and the second transmembrane domain, and the second transmembrane domain together comprise consecutive amino acids having the following sequence: -L, H or Q, K or M or T, PMY, F or L, FL- (SEQ ID NO: 56); (ii) the third transmembrane domain, and the loop between the third transmembrane domain and the fourth transmembrane domain together comprise consecutive amino acids having one of the following sequences: -M, A or S, YDR, F or Y, L or V, AIC- (SEQ ID NO: 58); or -DR, F or Y, L or V, AIC- (SEQ ID NO: 60); (iii) the loop between the fifth transmembrane domain and the sixth transmembrane domain, and the sixth transmembrane domain together comprise consecutive amino acids having one of the following sequences: -K or R, A or I or S or V, FSTC, A or G or S, SH- (SEQ ID NO: 62); or -FSTC, A or G or S, SH- (SEQ ID NO: 64); or (iv) the seventh transmembrane domain and the C-terminal domain together comprise consecutive amino acids having one of the following sequences: -P, M or L or V, F or L or V, NP, F or I, IY, C or S or T, LRN- (SEQ ID NO: 66); or -P, M or L or V, F or L or V, NP, F or I, IY- (SEQ ID NO: 68); or -NP, F or I, IY, C or S or T, LRN- (SEQ ID NO: 70).
 8. An isolated nucleic acid molecule encoding an odorant receptor protein, wherein the nucleic acid molecule comprises a nucleic acid sequence which can be amplified by polymerase chain reaction using: (a) any one of 5′ primers Al (SEQ ID NO: 37), A2 (SEQ ID NO: 38), A3 (SEQ ID NO: 39), A4 (SEQ ID NO: 40), or A5 (SEQ ID NO: 41); and (b) any one of 3′ primers E1 (SEQ ID NO: 42), B2 (SEQ ID NO: 43), B3 (SEQ ID NO: 44), B4 (SEQ ID NO: 45), B5 (SEQ ID NO: 46), or B6 (SEQ ID NO: 47).
 9. An isolated nucleic acid molecule encoding an odorant receptor protein, wherein the nucleic acid molecule comprises: (a) a nucleic acid sequence given in any one of FIGS. 9 to 31 (SEQ ID Nos: 1-10, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, or 35); or (b) a nucleic acid sequence degenerate to a sequence of (a) as a result of the genetic code.
 10. The isolated nucleic acid molecule of claim 8 or 9, wherein the odorant receptor protein comprises seven transmembrane domains.
 11. The isolated nucleic acid molecule of claim 1, wherein the loop between the fifth and sixth transmembrane domains consists of 17 amino acids.
 12. The isolated nucleic acid molecule of claim 10, wherein the loop between the fifth and sixth transmembrane domains consists of 17 amino acids.
 13. The isolated nucleic acid molecule of any one of claims 1, 6, 8, or 9, wherein the odorant receptor is a vertebrate odorant receptor.
 14. The isolated nucleic acid molecule of claim 13, wherein the vertebrate odorant receptor is a fish odorant receptor or a mammalian odorant receptor.
 15. The isolated nucleic acid molecule of claim 14, wherein the mammalian odorant receptor is a human odorant receptor, a rat odorant receptor, a mouse odorant receptor or a dog odorant receptor.
 16. The isolated nucleic acid molecule of claim 1, 6, 8, or 9, wherein the nucleic acid is DNA.
 17. The isolated nucleic acid molecule of claim 16, wherein the DNA is cDNA.
 18. A vector comprising the isolated nucleic acid molecule of claim 1, 6, 8, or
 9. 19. The vector of claim 18, wherein the vector additionally comprises elements necessary for replication and expression in a suitable host.
 20. A purified odorant receptor protein encoded by the isolated nucleic acid molecule of claim 1, 6, 8, or
 9. 21. A cell transfected with the vector of claim
 19. 22. The cell of claim 21, wherein the cell is an olfactory cell.
 23. The cell of claim 21, wherein the cell is a non-olfactory cell.
 24. The cell of claim 23, wherein prior to being transfected with the vector the non-olfactory cell does not express an odorant receptor protein.
 25. A method of identifying a desired odorant ligand, which comprises contacting a non-olfactory cell of claim 23, which express on its cell surface a known odorant receptor, with a series of odorant ligands and determining which ligands bind to the known odorant receptor on the non-olfactory cell.
 26. A method of identifying a desired odorant receptor, which comprises contacting a series of non-olfactory cells of claim 23 with a known odorant ligand and determining which odorant receptor binds with the odorant ligand.
 27. A method of detecting an odor which comprises: (a) identifying an odorant receptor which binds the desired odorant ligand identified by the method of claim 26; and (b) imbedding the receptor in a membrane such that when the odorant ligand binds with the receptor identified in (a) above, a detectable signal is produced.
 28. The method of claim 27 wherein the desired odorant ligand is a pheromone.
 29. The method of claim 27 wherein the desired odorant ligand is the vapor emanating from cocaine, marijuana, heroin, hashish, angel dust, gasoline, natural gas, alcohol, decayed human flesh, gun powder, an explosive, a plastic explosive, or a firearm.
 30. The method of claim 27 wherein the desired odorant ligand is a toxic fume, a noxious fume or a dangerous fume.
 31. The method of claim 27 wherein the membrane is a cell membrane, an olfactory cell membrane, or a synthetic membrane.
 32. The method of claim 27 wherein the detectable signal is a color change, a phosphorescence, a radioactivity, a visual signal, or an auditory signal.
 33. A method of quantifying the amount of an odorant ligand present in a sample which comprises the method of claim 27 wherein the detectable signal is quantified.
 34. A method of developing fragrances, which comprises identifying a desired odorant receptor by the method of claim 26, then contacting a non-olfactory cell, which has been transfected with an expression vector comprising an isolated nucleic acid molecule encoding the desired odorant receptor such that the receptor is expressed upon the surface of the non-olfactory cell, with a series of compounds to determine which compounds bind with the receptor.
 35. A method of identifying an odorant fingerprint, which comprises contacting a series of cells, which have been transformed such that each express a known odorant receptor encoded by a nucleic acid molecule of any one of claims 1, 6, 8, or 9, with a desired sample containing one or more odorant ligand and determining the type and quantity of the odorant ligands present in the sample.
 36. A method of identifying a compound which inhibits an odorant receptor, which comprises contacting an odorant receptor encoded by the nucleic acid molecule of any one of claims 1, 6, 8, or 9 with a series of compounds and determining which compound inhibits interaction between the odorant receptor and an odorant ligand known to bind to the odorant receptor.
 37. A method for identifying an appetite suppressant compound, which comprises identifying a compound by the method of claim 36 wherein the odorant receptor is associated with the perception of food.
 38. A pharmaceutical composition comprising a compound identified by the method of claim 37 and a pharmaceutically acceptable carrier.
 39. A nasal spray for controlling appetite, which comprises a compound identified by the method of claim 37 in a suitable carrier.
 40. A method for controlling appetite in a subject, which comprises administering to the subject an amount of a compound identified by the method of claim 37 effective to control the subject's appetite.
 41. The method of claim 40, which comprises administering the compound to the subject's olfactory epithelium.
 42. A method of trapping odors, which comprises contacting a membrane comprising a plurality of a desired odorant receptor encoded by the nucleic acid molecule of any one of claims 1, 6, 8, or 9 with a sample comprising a desired odorant ligand such that the desired odorant ligand is absorbed by the binding of the odorant ligand to the odorant receptor.
 43. An odor trap, which comprises a membrane comprising a plurality of a desired odorant receptor encoded by the nucleic acid molecule of any one of claims 1, 6, 8, or 9, such that a desired odorant ligand is absorbed by the binding of the odorant ligand to the odorant receptor.
 44. A method for controlling a pest population in an area, which comprises spraying the area with an odorant receptor ligand identified by the method of claim
 25. 45. The method of claim 44, wherein the odorant ligand is an alarm odorant ligand.
 46. The method of claim 44, wherein the odorant ligand interferes with an interaction between an odorant ligand and an odorant receptor associated with fertility.
 47. The method of claim 44, wherein the pest population is a population of rodents, mice, or rats.
 48. A pharmaceutical composition comprising an odorant ligand identified by the method of claim 25 and a pharmaceutically acceptable carrier.
 49. A method of promoting fertility in a subject which comprises administering to the subject an amount of an odorant ligand identified by the method of claim 25 effective to promote the subject's fertility.
 50. The method of claim 49, wherein the odorant ligand interacts with an odorant receptor associated with fertility.
 51. A method of inhibiting fertility in a subject which comprises administering to the subject an amount of an odorant ligand identified by the method of claim 25 effective to inhibit the subject's fertility.
 52. The method of claim 51, wherein the odorant ligand inhibits an interaction between an odorant ligand and an odorant receptor associated with fertility.
 53. The method of claim 49 or 51, which comprises administering the odorant ligand to the subject's olfactory epithelium.
 54. Use of an odorant ligand identified by the method of claim 25 for the preparation of a pharmaceutical composition for controlling a pest population in a desired area by spraying the desired area with the identified odorant ligand.
 55. The use of claim 54, wherein the odorant ligand is an alarm odorant ligand.
 56. Use of an odorant ligand identified by the method of claim 25 for the preparation of a pharmaceutical composition for controlling a pest population.
 57. The use of claim 56, wherein the odorant ligand interferes with the interaction between odorant ligands and odorant receptors which are associated with fertility.
 58. Use of any one of claims 54 to 57 wherein the pest population is a population of rodents, mice, or rats.
 59. Use of an odorant ligand identified by the method of claim 25 for the preparation of a pharmaceutical composition for promoting fertility.
 60. The use of claim 59, wherein the odorant ligand interacts with odorant receptors associated with fertility.
 61. Use of an odorant ligand identified by the method of claim 25 for the preparation of a pharmaceutical composition for inhibiting fertility.
 62. The use of claim 61, wherein the odorant ligand inhibits the interaction between odorant ligands and odorant receptors associated with fertility.
 63. Use of the compound identified by the method of claim 37 for the preparation of a pharmaceutical composition for controlling appetite in a subject. 